Accepted for/Published in: JMIR Medical Informatics
Date Submitted: Jul 11, 2022
Date Accepted: Jan 19, 2023
Warning: This is an author submission that is not peer-reviewed or edited. Preprints - unless they show as "accepted" - should not be relied on to guide clinical practice or health-related behavior and should not be reported in news media as established information.
Utilization of Near-Real-Time Natural Language Processing for Extraction of Abdominal Aortic Aneurysm Diagnosis from Radiology Reports
ABSTRACT
Background:
Management of abdominal aortic aneurysm (AAA) requires imaging surveillance to evaluate aneurysm size serially. Natural language processing (NLP) has been previously developed to identify patients with AAA retrospectively. However, there are no prior reported prospective studies using NLP to identify AAA patients in real-time from radiology reports
Objective:
To develop and validate a rule-based natural language processing (NLP) algorithm for near real-time automatic extraction of AAA diagnosis from radiology reports for prospective case identification.
Methods:
The AAA-NLP algorithm was developed and deployed on the electronic health record big data infrastructure for near real-time processing of radiology reports from 5/1/2019 to 9/30/2020. NLP extracted named entities for AAA case identification and classified subjects as cases and controls. The reference standard to assess algorithm performance was manual review of processed radiology reports by a trained physician and certified cardiologist following standardized criteria. Reviewers were blinded to diagnosis of each subject. The AAA-NLP algorithm was refined in three successive iterations. For each iteration the AAA-NLP algorithm was modified based on performance compared with the reference standard.
Results:
120 radiology reports were randomly selected for each iteration; a total of 360 reports were reviewed. At each iteration, the AAA-NLP algorithm performance improved. The algorithm identified AAA cases in near real-time with high positive predictive value (98 %), sensitivity (95 %), specificity (98 %), F1 score (97 %), and accuracy (97 %).
Conclusions:
Implementation of NLP for identification of AAA cases from radiology reports in near real-time with high performance is feasible. This prospective real-time NLP technique will generate automated input for patient care, quality projects, and clinical decision support tools for the management of AAA patients.
Citation
Request queued. Please wait while the file is being generated. It may take some time.
Copyright
© The authors. All rights reserved. This is a privileged document currently under peer-review/community review (or an accepted/rejected manuscript). Authors have provided JMIR Publications with an exclusive license to publish this preprint on it's website for review and ahead-of-print citation purposes only. While the final peer-reviewed paper may be licensed under a cc-by license on publication, at this stage authors and publisher expressively prohibit redistribution of this draft paper other than for review purposes.