JMIR Preprints #38178: One clinician is all you need: Data-Efficient NLP Measurement Extraction from Cardiac MRI Reports

Current Preprint Settings

(as selected by the authors)

1. When the manuscript is submitted, allow peer review from:

(a) Anybody (open community peer review)
(b) Editor-selected reviewers (closed peer review)

2. When the manuscript is submitted, display the preprint PDF to:

(a) Anybody, anytime
(b) Logged-in users only
(c) Anybody, anytime (title and abstract only)
(d) No one

3. When the manuscript is accepted, display the accepted manuscript PDF to:

(a) Anybody, anytime
(b) Logged-in users only
(c) Anybody, anytime (title and abstract only)
(d) No one

One clinician is all you need: Data-Efficient NLP Measurement Extraction from Cardiac MRI Reports

Pulkit Singh;
Julian Haimovich;
Christopher Reeder;
Shaan Khurshid;
Emily S. Lau;
Jonathan W Cunningham;
Anthony Philippakis;
Chris D Anderson;
Jennifer E. Ho;
Steven A. Lubitz;
Puneet Batra

ABSTRACT

Background:

Introduction Cardiac MRI (CMR) is a powerful imaging modality that provides detailed quantitative assessment of cardiac anatomy and function. Automated extraction of CMR measurements from clinical reports that are typically stored as unstructured text in electronic health record (EHR) systems would facilitate their use in research. Existing machine learning approaches either rely on large quantities of expert annotation, or require the development of engineered rules that are time-consuming and are specific to the setting in which they were developed.

Objective:

We hypothesize that the use of pre-trained transformer-based language models may enable label-efficient numerical extraction from clinical text without the need for heuristics or large quantities of expert annotations. Here we fine-tune pre-trained transformer-based language models on a small quantity of CMR annotations to extract 21 CMR measurements. We assessed the effect of clinical pre-training to reduce labeling needs and explored alternative representations of numerical inputs to improve performance.

Methods:

Our study sample comprised 99,252 patients that received longitudinal cardiology care in a multi-institutional healthcare system. There were 12,720 available CMR reports from 9,280 patients. We adapted PRAnCER, an annotation tool for clinical text, to collect annotations from a study clinician on 370 reports. We experimented with five different representations of numerical quantities and several model weight initializations. We evaluated extraction performance using macro-averaged F1 scores across the measurements of interest. We applied the best performing model to extract measurements from the remaining CMR reports in the study sample, and evaluated established associations between selected extracted measures with clinical outcomes to demonstrate validity.

Results:

All combinations of weight initializations and numerical representations obtained excellent performance on the gold-standard test set, suggesting that transformer models fine-tuned on a small set of annotations can effectively extract numerical quantities. Our results further indicate that custom numerical representations did not appear to have a significant impact on extraction performance. The best performing model achieved a macro-averaged F1 score of 0.957 across the evaluated CMR measurements (range 0.92 for lowest performing measure of left atrial anterior-posterior dimension to 1.0 for highest performing measures of left ventricular end systolic volume index and left ventricular end systolic diameter). Application of the best performing model to the study cohort yielded 136,407 measurements from all available reports in the study sample. We observed expected associations between extracted left ventricular mass index, left ventricular ejection fraction, and right ventricular ejection fraction with clinical outcomes like atrial fibrillation, heart failure, and mortality.

Conclusions:

This study demonstrated that a domain-agnostic pre-trained transformer model is able to effectively extract quantitative clinical measurements from diagnostic reports with a relatively small number of gold-standard annotations. The proposed workflow may serve as a roadmap for other quantitative entity extraction.

Citation

Please cite as:

Singh P, Haimovich J, Reeder C, Khurshid S, Lau ES, Cunningham JW, Philippakis A, Anderson CD, Ho JE, Lubitz SA, Batra P

One Clinician Is All You Need–Cardiac Magnetic Resonance Imaging Measurement Extraction: Deep Learning Algorithm Development

JMIR Med Inform 2022;10(9):e38178

DOI: 10.2196/38178

PMID: 35960155

PMCID: 9526125

Download PDF

Request queued. Please wait while the file is being generated. It may take some time.

Copyright

© The authors. All rights reserved. This is a privileged document currently under peer-review/community review (or an accepted/rejected manuscript). Authors have provided JMIR Publications with an exclusive license to publish this preprint on it's website for review and ahead-of-print citation purposes only. While the final peer-reviewed paper may be licensed under a cc-by license on publication, at this stage authors and publisher expressively prohibit redistribution of this draft paper other than for review purposes.

JMIR Publications

JMIR Preprints

Accepted for/Published in: JMIR Medical Informatics

Date Submitted: Mar 21, 2022

Date Accepted: Aug 11, 2022

Date Submitted to PubMed: Aug 12, 2022

One clinician is all you need: Data-Efficient NLP Measurement Extraction from Cardiac MRI Reports

ABSTRACT

Citation

Copyright