Maintenance Notice

Due to necessary scheduled maintenance, the JMIR Publications website will be unavailable from Wednesday, July 01, 2020 at 8:00 PM to 10:00 PM EST. We apologize in advance for any inconvenience this may cause you.

Who will be affected?

Accepted for/Published in: JMIR AI

Date Submitted: Dec 4, 2024
Open Peer Review Period: Dec 23, 2024 - Feb 17, 2025
Date Accepted: May 14, 2025
(closed for review but you can still tweet)

The final, peer-reviewed published version of this preprint can be found here:

Enhancing Magnetic Resonance Imaging (MRI) Report Comprehension in Spinal Trauma: Readability Analysis of AI-Generated Explanations for Thoracolumbar Fractures

Sing DC, Shah KS, Pompliano M, Yi PH, Velluto C, Bagheri A, Eastlack RK, Stephan S, Mundis GM

Enhancing Magnetic Resonance Imaging (MRI) Report Comprehension in Spinal Trauma: Readability Analysis of AI-Generated Explanations for Thoracolumbar Fractures

JMIR AI 2025;4:e69654

DOI: 10.2196/69654

PMID: 40611700

PMCID: 12231343

Warning: This is an author submission that is not peer-reviewed or edited. Preprints - unless they show as "accepted" - should not be relied on to guide clinical practice or health-related behavior and should not be reported in news media as established information.

Enhancing MRI Report Comprehension in Spinal Trauma: An Analysis of AI-Generated Explanations for Thoracolumbar Fractures

  • David C. Sing; 
  • Kishan S. Shah; 
  • Michael Pompliano; 
  • Paul H. Yi; 
  • Calogero Velluto; 
  • Ali Bagheri; 
  • Robert K. Eastlack; 
  • Stephen Stephan; 
  • Gregory M. Mundis

ABSTRACT

Background:

MRI reports are challenging for patients to interpret and may potentially subject patients to unnecessary anxiety. The advent of advanced artificial intelligence (AI) language models such as GPT-4 hold promise in translating complex medical information into layman terms.

Objective:

To evaluate the accuracy, helpfulness, and readability of GPT-4 in explaining MRI reports of patients with thoracic or lumbar fracture.

Methods:

MRI reports of 20 patients presenting with thoracic or lumbar fracture were obtained. GPT-4o was prompted to explain the MRI report in layman’s terms. The generated explanations were then presented to 7 spine surgeons for evaluation. The MRI report text and GPT-4o explanations were then analyzed to grade the readability of the texts using Flesch Readability Score and Flesch-Kincaid Grade Level Scale.

Results:

The layman explanations provided by GPT-4o were found to be helpful by all surgeons in 17 cases, with 6 of 7 surgeons finding the information helpful in 3 cases. The GPT-4o explanations were considered accurate by all surgeons in 11 cases, with 6 surgeons considering the information accurate in 5 cases, and 4 or 5 surgeons considering the information accurate in 4 cases. Review of surgeon feedback on inaccuracies revealed that the radiology reports were often insufficiently detailed. The mean readability score of the MRI reports was significantly lower than the GPT-4o explanations (32.2 +/- 16.3 vs 53.9 +/- 8.1, p<0.001). The mean reading grade level score of the MRI reports trended higher compared to the GPT-4o explanations (11-12th grade vs 10-11th grade level, p=0.11).

Conclusions:

Overall helpfulness and readability ratings for AI-generated summaries of MRI reports were high, with few inaccuracies recorded. This study demonstrates the potential of GPT-4o as a valuable tool for enhancing patient comprehension of MRI report findings.


 Citation

Please cite as:

Sing DC, Shah KS, Pompliano M, Yi PH, Velluto C, Bagheri A, Eastlack RK, Stephan S, Mundis GM

Enhancing Magnetic Resonance Imaging (MRI) Report Comprehension in Spinal Trauma: Readability Analysis of AI-Generated Explanations for Thoracolumbar Fractures

JMIR AI 2025;4:e69654

DOI: 10.2196/69654

PMID: 40611700

PMCID: 12231343

Download PDF


Request queued. Please wait while the file is being generated. It may take some time.

© The authors. All rights reserved. This is a privileged document currently under peer-review/community review (or an accepted/rejected manuscript). Authors have provided JMIR Publications with an exclusive license to publish this preprint on it's website for review and ahead-of-print citation purposes only. While the final peer-reviewed paper may be licensed under a cc-by license on publication, at this stage authors and publisher expressively prohibit redistribution of this draft paper other than for review purposes.