Accepted for/Published in: JMIR Medical Informatics
Date Submitted: Oct 2, 2025
Open Peer Review Period: Oct 15, 2025 - Dec 10, 2025
Date Accepted: May 8, 2026
(closed for review but you can still tweet)
Multimodal Prediction of Renal Tumor Malignancy from Radiology Reports and Structured Electronic Health Records: Retrospective Cohort Study
ABSTRACT
Background:
Accurate preoperative prediction of renal tumor malignancy is essential but remains challenging. While radiology and structured electronic health record (EHR) data are widely used for tumor evaluation, radiology reports—though rich in diagnostic information, have been relatively underutilized due to extraction challenges.
Objective:
To enhance malignancy prediction of renal tumors by developing a multimodal pipeline that integrates structured EHR data with information extracted from radiology report text using advanced large language models.
Methods:
We developed a predictive framework that integrates structured EHR variables with features derived from radiology reports. Large language models (LLMs) were used to extract abnormality characteristics, and a pretrained biomedical transformers (i.e., RadBERT) was applied to generate contextual embeddings from unstructured radiology text. These textual features were fused with structured data using early, middle and late fusion strategies. Model performance was evaluated using standard classification metrics, including accuracy, precision, recall, specificity, area under the ROC curve (AUC) and F1-score.
Results:
Incorporating RadBERT-derived textual features improved prediction performance, while LLM-extracted abnormality characteristics contributed modest gains. Among fusion strategies, early fusion achieved the highest AUC of 0.818 (± 0.010), and late fusion provided the best F1-score of 0.779 (±0.022), both outperforming unimodal baselines.
Conclusions:
Our findings demonstrate the value of leveraging unstructured radiology reports through advanced natural language processing (NLP) techniques for malignancy predication. This multimodal fusion approach enhances preoperative renal tumor assessment and has the potential to reduce unnecessary surgeries and improve patient care.
Citation
Request queued. Please wait while the file is being generated. It may take some time.
Copyright
© The authors. All rights reserved. This is a privileged document currently under peer-review/community review (or an accepted/rejected manuscript). Authors have provided JMIR Publications with an exclusive license to publish this preprint on it's website for review and ahead-of-print citation purposes only. While the final peer-reviewed paper may be licensed under a cc-by license on publication, at this stage authors and publisher expressively prohibit redistribution of this draft paper other than for review purposes.