Accepted for/Published in: JMIR Medical Informatics
Date Submitted: Sep 30, 2020
Date Accepted: Feb 20, 2021
Warning: This is an author submission that is not peer-reviewed or edited. Preprints - unless they show as "accepted" - should not be relied on to guide clinical practice or health-related behavior and should not be reported in news media as established information.
DrugEx: An End-to-End System to Extract Drug Names and Associated Attributes from Clinical Free-Text
ABSTRACT
Background:
As drug prescriptions are often recorded in free-text clinical narratives, extracting such information is important to support complex health-related tasks. Several natural language processing (NLP) methods have been proposed to extract such information, but still with limited performance.
Objective:
This paper describes (DrugEx), which extracts drugs and their attributes from clinical free-text notes. The study aims to evaluate the feasibility of using NLP and deep learning approaches for extracting and linking drug-associated attributes. It also presents an extensive error analysis of different methods. This effort was part of the 2018 National NLP Clinical Challenges (n2c2) Shared Task on Adverse Drug Events and Medication Extraction.
Methods:
The proposed method (DrugEx) consists of a named entity recogniser (NER) to identify drugs and associated attributes, and relation extraction (RE) component to identify relations between them. For the NER, we explored deep learning-based approaches (i.e. Bi-LSTM-CRFs) with various embeddings (i.e. word, character and semantic-feature embeddings) in order to investigate how different embeddings influence the performance. For RE, a rule-based method was implemented and compared with a positional-aware LSTM model. The methods were trained and evaluated using the 2018 n2c2 shared-task data.
Results:
Experiments showed that the best model (Bi-LSTM-CRFs with words and character embeddings) achieved lenient micro F-scores of 0.921 for NER, 0.927 for RE and 0.855 for the end-to-end system.
Conclusions:
The proposed end-to-end system achieves encouraging results and demonstrates the feasibility of using deep learning methods for extracting medication information from free-text data.
Citation
Request queued. Please wait while the file is being generated. It may take some time.
Copyright
© The authors. All rights reserved. This is a privileged document currently under peer-review/community review (or an accepted/rejected manuscript). Authors have provided JMIR Publications with an exclusive license to publish this preprint on it's website for review and ahead-of-print citation purposes only. While the final peer-reviewed paper may be licensed under a cc-by license on publication, at this stage authors and publisher expressively prohibit redistribution of this draft paper other than for review purposes.