Maintenance Notice

Due to necessary scheduled maintenance, the JMIR Publications website will be unavailable from Wednesday, July 01, 2020 at 8:00 PM to 10:00 PM EST. We apologize in advance for any inconvenience this may cause you.

Who will be affected?

Accepted for/Published in: Journal of Medical Internet Research

Date Submitted: Jun 2, 2025
Date Accepted: Nov 7, 2025
(closed for review but you can still tweet)

The final, peer-reviewed published version of this preprint can be found here:

Scalable and Robust Artificial Intelligence for Spine Alignment Assessment: Multicenter Study Enabled by Real-Time Data Transformation

Chen G, Meng N, Zhuang Y, Chen Z, Bian Z, Gong Z, Shi J, Huang T, Kuang X, Lu P, Nie C, Yu Q, Chen Z, Jiang H, Zhang Z, Zheng C, Liang Y, Wu N, Cheung JPY, Zhang J, Zhang T

Scalable and Robust Artificial Intelligence for Spine Alignment Assessment: Multicenter Study Enabled by Real-Time Data Transformation

J Med Internet Res 2026;28:e78396

DOI: 10.2196/78396

PMID: 41861366

Scalable and Robust Artificial Intelligence for Spine Alignment Assessment: A Multicentre Study Enabled by Real-Time Data Transformation

  • Guilin Chen; 
  • Nan Meng; 
  • Yipeng Zhuang; 
  • Zhe Chen; 
  • Zhen Bian; 
  • Zhaoyang Gong; 
  • Jiawei Shi; 
  • Tao Huang; 
  • Xihe Kuang; 
  • Pengyu Lu; 
  • Cong Nie; 
  • Qifeng Yu; 
  • Zefu Chen; 
  • Hui Jiang; 
  • Zhongmin Zhang; 
  • Chaojun Zheng; 
  • Yu Liang; 
  • Nan Wu; 
  • Jason Pui Yin Cheung; 
  • Jianguo Zhang; 
  • Teng Zhang

ABSTRACT

Background:

Artificial intelligence (AI) has shown promise for automating spinal alignment assessment in adolescent idiopathic scoliosis (AIS). However, AI models typically exhibit reduced accuracy and robustness when deployed across multiple medical centres due to variability in imaging protocols and data characteristics, potentially compromising clinical diagnosis and treatment decisions.

Objective:

This study aimed to develop a real-time, plug-and-play data transformation method to enhance the robustness of deep learning models against data heterogeneity in radiographs, thereby improving their performance in assessing AIS across multiple medical centres.

Methods:

In this retrospective multicentre study, 4,111 full-spine radiographs from seven hospitals (two from Hong Kong and five from mainland China), collected between January 2012 and August 2024, were included. Data from two hospitals in Hong Kong (n=3,034) were used for model training and internal validation, while radiographs from the five mainland hospitals (n=1,077) formed five independent external validation datasets. A novel pixel-intensity-based data transformation method was developed to standardize image contrast and brightness across datasets and integrated into the model training process to enhance our previously developed AI model, SpineHRNet+. The enhanced model's accuracy and robustness for Cobb angle prediction and severity classification were evaluated using both internal and external datasets. Data heterogeneity across centres was quantified by brightness and contrast differences. Cobb angle prediction accuracy was evaluated using residual analysis, linear regression (coefficient of determination [R²]), and Bland-Altman analyses. Model performance for disease severity classification was assessed using sensitivity, specificity, precision, negative predictive value (NPV), accuracy, and confusion matrix analysis.

Results:

In this retrospective multicentre study, 4,111 full-spine radiographs from seven hospitals (two from Hong Kong and five from mainland China), collected between January 2012 and August 2024, were included. Data from two hospitals in Hong Kong (n=3,034) were used for model training and internal validation, while radiographs from the five mainland hospitals (n=1,077) formed five independent external validation datasets. A novel pixel-intensity-based data transformation method was developed to standardize image contrast and brightness across datasets and integrated into the model training process to enhance our previously developed AI model, SpineHRNet+. The enhanced model's accuracy and robustness for Cobb angle prediction and severity classification were evaluated using both internal and external datasets. Data heterogeneity across centres was quantified by brightness and contrast differences. Cobb angle prediction accuracy was evaluated using residual analysis, linear regression (coefficient of determination [R²]), and Bland-Altman analyses. Model performance for disease severity classification was assessed using sensitivity, specificity, precision, negative predictive value (NPV), accuracy, and confusion matrix analysis.

Conclusions:

The proposed data transformation approach effectively addressed data heterogeneity, significantly improving the accuracy and robustness of SpineHRNet+ in multicentre AIS assessments. The real-time processing capability and preservation of anatomical integrity underscore the method’s clinical practicality, enabling scalable and reliable AI applications in diverse healthcare environments. Clinical Trial: Name: Artificial Intelligence-based Models for Spine Malalignment Auto-analysis ClinicalTrials.gov ID: NCT06711757 URL: https://clinicaltrials.gov/study/NCT06711757?cond=NCT06711757&rank=1&tab=results


 Citation

Please cite as:

Chen G, Meng N, Zhuang Y, Chen Z, Bian Z, Gong Z, Shi J, Huang T, Kuang X, Lu P, Nie C, Yu Q, Chen Z, Jiang H, Zhang Z, Zheng C, Liang Y, Wu N, Cheung JPY, Zhang J, Zhang T

Scalable and Robust Artificial Intelligence for Spine Alignment Assessment: Multicenter Study Enabled by Real-Time Data Transformation

J Med Internet Res 2026;28:e78396

DOI: 10.2196/78396

PMID: 41861366

Download PDF


Request queued. Please wait while the file is being generated. It may take some time.