Accepted for/Published in: JMIR Medical Informatics
Date Submitted: Apr 15, 2018
Open Peer Review Period: Apr 17, 2018 - Jun 12, 2018
Date Accepted: Nov 1, 2018
(closed for review but you can still tweet)
Detection of Bleeding Events in Electronic Health Record Notes Using Convolutional Neural Network Models Enhanced With Recurrent Neural Network Autoencoders
ABSTRACT
Background:
Bleeding events are common and critical which may cause significant morbidity and mortality. Studies show that high incidences of bleeding events are associated with cardiovascular disease (CVD) patients on anticoagulant therapy. Prompt and accurate detection of bleeding events are essential for preventing serious consequences. As bleeding events are often described in clinical notes, automatic detection of bleeding events from Electronic Health Record (EHR) narratives has the potential to improve drug safety surveillance and pharmacovigilance.
Objective:
We developed a natural language processing (NLP) system to automatically classify whether an EHR note sentence contains a bleeding event.
Methods:
We expert-annotated 878 EHR notes (76,577 sentences and 562,630 word tokens) for identifying bleeding events at the sentence-level. This annotated corpus was then used to train and validate our NLP systems. We developed an innovative hybrid CNN and LSTM Autoencoder model (HCLA), which integrates a convolutional neural network architecture (CNN) with a bidirectional Long-short term memory (BiLSTM) autoencoder model to leverage large unlabeled EHR data.
Results:
HCLA achieved the best AUC-ROC (0.957) and F1 (0.938) scores for identifying whether a sentence contains a bleeding event, surpassing the strong baseline SVM and other CNN and autoencoder models.
Conclusions:
By incorporating a supervised CNN model with a pre-trained unsupervised BiLSTM Autoencoder, HCLA achieved a high performance in detecting bleeding events.
Citation
Per the author's request the PDF is not available.
Copyright
© The authors. All rights reserved. This is a privileged document currently under peer-review/community review (or an accepted/rejected manuscript). Authors have provided JMIR Publications with an exclusive license to publish this preprint on it's website for review and ahead-of-print citation purposes only. While the final peer-reviewed paper may be licensed under a cc-by license on publication, at this stage authors and publisher expressively prohibit redistribution of this draft paper other than for review purposes.