Maintenance Notice

Due to necessary scheduled maintenance, the JMIR Publications website will be unavailable from Wednesday, July 01, 2020 at 8:00 PM to 10:00 PM EST. We apologize in advance for any inconvenience this may cause you.

Who will be affected?

Accepted for/Published in: JMIR Medical Informatics

Date Submitted: Apr 15, 2018
Open Peer Review Period: Apr 17, 2018 - Jun 12, 2018
Date Accepted: Nov 1, 2018
(closed for review but you can still tweet)

The final, peer-reviewed published version of this preprint can be found here:

Detection of Bleeding Events in Electronic Health Record Notes Using Convolutional Neural Network Models Enhanced With Recurrent Neural Network Autoencoders: Deep Learning Approach

LI R, HU B, LIU F, LIU W, Cunningham F, McManus DD, YU H

Detection of Bleeding Events in Electronic Health Record Notes Using Convolutional Neural Network Models Enhanced With Recurrent Neural Network Autoencoders: Deep Learning Approach

JMIR Med Inform 2019;7(1):e10788

DOI: 10.2196/10788

PMID: 30735140

PMCID: 6384542

Detection of Bleeding Events in Electronic Health Record Notes Using Convolutional Neural Network Models Enhanced With Recurrent Neural Network Autoencoders

  • Rumeng LI; 
  • Baotian HU; 
  • Feifan LIU; 
  • Weisong LIU; 
  • Fran Cunningham; 
  • David D McManus; 
  • Hong YU

ABSTRACT

Background:

Bleeding events are common and critical which may cause significant morbidity and mortality. Studies show that high incidences of bleeding events are associated with cardiovascular disease (CVD) patients on anticoagulant therapy. Prompt and accurate detection of bleeding events are essential for preventing serious consequences. As bleeding events are often described in clinical notes, automatic detection of bleeding events from Electronic Health Record (EHR) narratives has the potential to improve drug safety surveillance and pharmacovigilance.

Objective:

We developed a natural language processing (NLP) system to automatically classify whether an EHR note sentence contains a bleeding event.

Methods:

We expert-annotated 878 EHR notes (76,577 sentences and 562,630 word tokens) for identifying bleeding events at the sentence-level. This annotated corpus was then used to train and validate our NLP systems. We developed an innovative hybrid CNN and LSTM Autoencoder model (HCLA), which integrates a convolutional neural network architecture (CNN) with a bidirectional Long-short term memory (BiLSTM) autoencoder model to leverage large unlabeled EHR data.

Results:

HCLA achieved the best AUC-ROC (0.957) and F1 (0.938) scores for identifying whether a sentence contains a bleeding event, surpassing the strong baseline SVM and other CNN and autoencoder models.

Conclusions:

By incorporating a supervised CNN model with a pre-trained unsupervised BiLSTM Autoencoder, HCLA achieved a high performance in detecting bleeding events.


 Citation

Please cite as:

LI R, HU B, LIU F, LIU W, Cunningham F, McManus DD, YU H

Detection of Bleeding Events in Electronic Health Record Notes Using Convolutional Neural Network Models Enhanced With Recurrent Neural Network Autoencoders: Deep Learning Approach

JMIR Med Inform 2019;7(1):e10788

DOI: 10.2196/10788

PMID: 30735140

PMCID: 6384542

Per the author's request the PDF is not available.

© The authors. All rights reserved. This is a privileged document currently under peer-review/community review (or an accepted/rejected manuscript). Authors have provided JMIR Publications with an exclusive license to publish this preprint on it's website for review and ahead-of-print citation purposes only. While the final peer-reviewed paper may be licensed under a cc-by license on publication, at this stage authors and publisher expressively prohibit redistribution of this draft paper other than for review purposes.