JMIR Preprints #66269: Interpretable machine learning to predict the malignancy risk of follicular thyroid neoplasms in the extremely unbalanced data: Experiences from a real-world study and literature review

Current Preprint Settings

(as selected by the authors)

1. When the manuscript is submitted, allow peer review from:

(a) Anybody (open community peer review)
(b) Editor-selected reviewers (closed peer review)

2. When the manuscript is submitted, display the preprint PDF to:

(a) Anybody, anytime
(b) Logged-in users only
(c) Anybody, anytime (title and abstract only)
(d) No one

3. When the manuscript is accepted, display the accepted manuscript PDF to:

(a) Anybody, anytime
(b) Logged-in users only
(c) Anybody, anytime (title and abstract only)
(d) No one

Interpretable machine learning to predict the malignancy risk of follicular thyroid neoplasms in the extremely unbalanced data: Experiences from a real-world study and literature review

Rui Shan;
Xin Li;
Jing Chen;
Zheng Chen;
Yuan-Jia Cheng;
Bo Han;
Run-Ze Hu;
Jiu-Ping Huang;
Gui-Lan Kong;
Hui Liu;
Fang Mei;
Shi-Bing Song;
Bang-Kai Sun;
Hui Tian;
Yang Wang;
Wu-Cai Xiao;
Xiang-Yun Yao;
Jing-Ming Ye;
Bo Yu;
Chun-Hui Yuan;
Fan Zhang;
Zheng Liu

ABSTRACT

Background:

Diagnosis and treatment of follicular thyroid neoplasms (FTN) remains a huge challenge, as the malignancy risk of FTN cannot be ascertained until the diagnostic surgery.

Objective:

We aimed to use interpretable machine learning to predict the malignant risk of FTN before surgery in the real-world setting.

Methods:

We conducted a retrospective cohort study at the Peking University Third Hospital in Beijing, China between January 2012 and September 2023. We included patients with postoperative pathological diagnoses of follicular thyroid adenoma (FTA) or carcinoma (FTC) and excluded those without thyroid ultrasonography before surgery. We used 22 predictors of demographic characteristics, thyroid sonography, and hormones to train 5 machine learning models and chose the optimal model that performed well in model discrimination, calibration, interpretability, and parsimony. For this extremely unbalanced data (the ratio of FTA to FTC much greater than 1), we adopted both the area under receiver operating characteristic curve (AUROC) and the area under the precision-recall curve (AUPRC) to assess model discrimination. We also summarized the experiences derived from our study findings and a literature review of the existing evidence.

Results:

This cohort included 1543 patients (mean age: 47.98 ± 14.14 years, female: 73%) with 1676 FTN tumors (FTA: n = 1418; FTC: n = 258; the ratio of FTA to FTC: 5.5). The random forest was chosen as the optimal model, revealing that mean TSH score, mean tumor diameter, mean TSH, margin, and TSH instability were the five most important predictors in discriminating FTA from FTC, with the AUROC of 0.79 (95% CI, 0.77-0.80) and AUPRC of 0.40 (0.38-0.42). The effect of mean diameter on malignancy risk varied with TSH instability; specifically, the malignancy risk was elevated in larger tumors with greater TSH instability, whereas it was reduced in larger tumors with more stable TSH levels. The risk of malignancy tended to nonlinearly increase with a larger mean tumor diameter or higher instability in TSH levels, while it nonlinearly decreased with a higher mean TSH score or mean TSH level. FTCs with small size (mean diameter: 2.89 cm) were more likely to be misclassified as FTAs than those with greater size. Our literature review showed that (1) the ratio of FTA to FTC varied from 0.6 to 4.0 which was lower than the natural distribution of 5.0; (2) no studies have used the AUPRC to assess prediction performance in the unbalanced data; and (3) the external validation of previously derived model did not perform as well as in the original study.

Conclusions:

TSH measurements and tumor size were important in screening the malignancy risk of FTN before the diagnostic surgery. However, it remained challenging to assess the malignancy risk of small-size FTNs. Future studies are needed to overcome the prediction challenge embedded in the extremely unbalanced distribution of FTA and FTC in real-world data. Clinical Trial: This study was approved by the Medical Research Ethics Committee of Peking University Third Hospital (No. IRB00006761-M2023168).

Citation

Please cite as:

Shan R, Li X, Chen J, Chen Z, Cheng YJ, Han B, Hu RZ, Huang JP, Kong GL, Liu H, Mei F, Song SB, Sun BK, Tian H, Wang Y, Xiao WC, Yao XY, Ye JM, Yu B, Yuan CH, Zhang F, Liu Z

Interpretable Machine Learning to Predict the Malignancy Risk of Follicular Thyroid Neoplasms in Extremely Unbalanced Data: Retrospective Cohort Study and Literature Review

JMIR Cancer 2025;11:e66269

DOI: 10.2196/66269

PMID: 39930991

PMCID: 11833187

Download PDF

Request queued. Please wait while the file is being generated. It may take some time.

Copyright

© The authors. All rights reserved. This is a privileged document currently under peer-review/community review (or an accepted/rejected manuscript). Authors have provided JMIR Publications with an exclusive license to publish this preprint on it's website for review and ahead-of-print citation purposes only. While the final peer-reviewed paper may be licensed under a cc-by license on publication, at this stage authors and publisher expressively prohibit redistribution of this draft paper other than for review purposes.

JMIR Publications

JMIR Preprints

Accepted for/Published in: JMIR Cancer

Date Submitted: Sep 9, 2024

Open Peer Review Period: Sep 13, 2024 - Nov 8, 2024

Date Accepted: Dec 4, 2024

(closed for review but you can still tweet)

Interpretable machine learning to predict the malignancy risk of follicular thyroid neoplasms in the extremely unbalanced data: Experiences from a real-world study and literature review

ABSTRACT

Citation

Copyright