Maintenance Notice

Due to necessary scheduled maintenance, the JMIR Publications website will be unavailable from Wednesday, July 01, 2020 at 8:00 PM to 10:00 PM EST. We apologize in advance for any inconvenience this may cause you.

Who will be affected?

Accepted for/Published in: JMIR Medical Informatics

Date Submitted: Mar 21, 2022
Date Accepted: Aug 11, 2022
Date Submitted to PubMed: Aug 12, 2022

The final, peer-reviewed published version of this preprint can be found here:

One Clinician Is All You Need–Cardiac Magnetic Resonance Imaging Measurement Extraction: Deep Learning Algorithm Development

Singh P, Haimovich J, Reeder C, Khurshid S, Lau ES, Cunningham JW, Philippakis A, Anderson CD, Ho JE, Lubitz SA, Batra P

One Clinician Is All You Need–Cardiac Magnetic Resonance Imaging Measurement Extraction: Deep Learning Algorithm Development

JMIR Med Inform 2022;10(9):e38178

DOI: 10.2196/38178

PMID: 35960155

PMCID: 9526125

One clinician is all you need: Data-Efficient NLP Measurement Extraction from Cardiac MRI Reports

  • Pulkit Singh; 
  • Julian Haimovich; 
  • Christopher Reeder; 
  • Shaan Khurshid; 
  • Emily S. Lau; 
  • Jonathan W Cunningham; 
  • Anthony Philippakis; 
  • Chris D Anderson; 
  • Jennifer E. Ho; 
  • Steven A. Lubitz; 
  • Puneet Batra

ABSTRACT

Background:

Introduction Cardiac MRI (CMR) is a powerful imaging modality that provides detailed quantitative assessment of cardiac anatomy and function. Automated extraction of CMR measurements from clinical reports that are typically stored as unstructured text in electronic health record (EHR) systems would facilitate their use in research. Existing machine learning approaches either rely on large quantities of expert annotation, or require the development of engineered rules that are time-consuming and are specific to the setting in which they were developed.

Objective:

We hypothesize that the use of pre-trained transformer-based language models may enable label-efficient numerical extraction from clinical text without the need for heuristics or large quantities of expert annotations. Here we fine-tune pre-trained transformer-based language models on a small quantity of CMR annotations to extract 21 CMR measurements. We assessed the effect of clinical pre-training to reduce labeling needs and explored alternative representations of numerical inputs to improve performance.

Methods:

Our study sample comprised 99,252 patients that received longitudinal cardiology care in a multi-institutional healthcare system. There were 12,720 available CMR reports from 9,280 patients. We adapted PRAnCER, an annotation tool for clinical text, to collect annotations from a study clinician on 370 reports. We experimented with five different representations of numerical quantities and several model weight initializations. We evaluated extraction performance using macro-averaged F1 scores across the measurements of interest. We applied the best performing model to extract measurements from the remaining CMR reports in the study sample, and evaluated established associations between selected extracted measures with clinical outcomes to demonstrate validity.

Results:

All combinations of weight initializations and numerical representations obtained excellent performance on the gold-standard test set, suggesting that transformer models fine-tuned on a small set of annotations can effectively extract numerical quantities. Our results further indicate that custom numerical representations did not appear to have a significant impact on extraction performance. The best performing model achieved a macro-averaged F1 score of 0.957 across the evaluated CMR measurements (range 0.92 for lowest performing measure of left atrial anterior-posterior dimension to 1.0 for highest performing measures of left ventricular end systolic volume index and left ventricular end systolic diameter). Application of the best performing model to the study cohort yielded 136,407 measurements from all available reports in the study sample. We observed expected associations between extracted left ventricular mass index, left ventricular ejection fraction, and right ventricular ejection fraction with clinical outcomes like atrial fibrillation, heart failure, and mortality.

Conclusions:

This study demonstrated that a domain-agnostic pre-trained transformer model is able to effectively extract quantitative clinical measurements from diagnostic reports with a relatively small number of gold-standard annotations. The proposed workflow may serve as a roadmap for other quantitative entity extraction.


 Citation

Please cite as:

Singh P, Haimovich J, Reeder C, Khurshid S, Lau ES, Cunningham JW, Philippakis A, Anderson CD, Ho JE, Lubitz SA, Batra P

One Clinician Is All You Need–Cardiac Magnetic Resonance Imaging Measurement Extraction: Deep Learning Algorithm Development

JMIR Med Inform 2022;10(9):e38178

DOI: 10.2196/38178

PMID: 35960155

PMCID: 9526125

Download PDF


Request queued. Please wait while the file is being generated. It may take some time.

© The authors. All rights reserved. This is a privileged document currently under peer-review/community review (or an accepted/rejected manuscript). Authors have provided JMIR Publications with an exclusive license to publish this preprint on it's website for review and ahead-of-print citation purposes only. While the final peer-reviewed paper may be licensed under a cc-by license on publication, at this stage authors and publisher expressively prohibit redistribution of this draft paper other than for review purposes.