Maintenance Notice

Due to necessary scheduled maintenance, the JMIR Publications website will be unavailable from Wednesday, July 01, 2020 at 8:00 PM to 10:00 PM EST. We apologize in advance for any inconvenience this may cause you.

Who will be affected?

Accepted for/Published in: JMIR Medical Informatics

Date Submitted: Feb 8, 2023
Date Accepted: Mar 24, 2023

The final, peer-reviewed published version of this preprint can be found here:

Deep Learning Approach for Negation and Speculation Detection for Automated Important Finding Flagging and Extraction in Radiology Report: Internal Validation and Technique Comparison Study

Weng KH, Liu CF, Chen CJ

Deep Learning Approach for Negation and Speculation Detection for Automated Important Finding Flagging and Extraction in Radiology Report: Internal Validation and Technique Comparison Study

JMIR Med Inform 2023;11:e46348

DOI: 10.2196/46348

PMID: 37097731

PMCID: 10170361

Deep Learning Approach for Negation and Speculation Detection for Automated Important Finding Flagging and Extraction in Radiology Report: An Internal Validation and Technique Comparison Study

  • Kung-Hsun Weng; 
  • Chung-Feng Liu; 
  • Chia-Jung Chen

ABSTRACT

Background:

Negation and the speculation unrelated to abnormal findings can lead to false positive alarms for automatic radiology report highlighting or flagging by laboratory information systems.

Objective:

This internal validation study evaluates the performance of NLP methods (NegEx, NegBio, NegBERT, and Transformers).

Methods:

We annotated all negative and the speculative statements unrelated to abnormal findings in reports. In Experiment 1, we fine-tuned several Transformer models (ALBERT, BERT, DeBERTa, DistilBERT, ELECTRA, ERNIE, RoBERTa, SpanBERT, XLNet) and compared their performance using precision, recall, accuracy, and F1 scores. In Experiment 2, we compared the best model from Experiment 1 with three established negation and speculation detection algorithms (NegEx, NegBio, NegBERT).

Results:

Our study collected 6000 radiology reports from three branches of Chi Mei Hospital, covering multiple imaging modalities and body parts. 15.0% of words and 39.5% of important diagnostic keywords occurred in negative statements or speculative statements unrelated to abnormal findings. In experiment 1, all models achieved accuracy > 98% and F1 score > 90% on the test dataset. ALBERT showed the best performance (accuracy 99.1%, F1 score 95.8%). In experiment 2, ALBERT outperformed the optimized NegEx, NegBio, and NegBERT methods overall (accuracy 99.6%, F1 score 99.1%) and in the prediction of whether diagnostic keywords occur in speculative statements unrelated to abnormal findings.

Conclusions:

The ALBERT deep learning method showed the best performance. Our result represents a significant advance in the clinical application of computer-aided notification systems. Clinical Trial: Not applicable


 Citation

Please cite as:

Weng KH, Liu CF, Chen CJ

Deep Learning Approach for Negation and Speculation Detection for Automated Important Finding Flagging and Extraction in Radiology Report: Internal Validation and Technique Comparison Study

JMIR Med Inform 2023;11:e46348

DOI: 10.2196/46348

PMID: 37097731

PMCID: 10170361

Download PDF


Request queued. Please wait while the file is being generated. It may take some time.

© The authors. All rights reserved. This is a privileged document currently under peer-review/community review (or an accepted/rejected manuscript). Authors have provided JMIR Publications with an exclusive license to publish this preprint on it's website for review and ahead-of-print citation purposes only. While the final peer-reviewed paper may be licensed under a cc-by license on publication, at this stage authors and publisher expressively prohibit redistribution of this draft paper other than for review purposes.