JMIR Preprints #24678: DrugEx: An End-to-End System to Extract Drug Names and Associated Attributes from Clinical Free-Text

Current Preprint Settings

(as selected by the authors)

1. When the manuscript is submitted, allow peer review from:

(a) Anybody (open community peer review)
(b) Editor-selected reviewers (closed peer review)

2. When the manuscript is submitted, display the preprint PDF to:

(a) Anybody, anytime
(b) Logged-in users only
(c) Anybody, anytime (title and abstract only)
(d) No one

3. When the manuscript is accepted, display the accepted manuscript PDF to:

(a) Anybody, anytime
(b) Logged-in users only
(c) Anybody, anytime (title and abstract only)
(d) No one

DrugEx: An End-to-End System to Extract Drug Names and Associated Attributes from Clinical Free-Text

Ghada Alfattni;
Maksim Belousov;
Niels Peek;
Goran Nenadic

ABSTRACT

Background:

As drug prescriptions are often recorded in free-text clinical narratives, extracting such information is important to support complex health-related tasks. Several natural language processing (NLP) methods have been proposed to extract such information, but still with limited performance.

Objective:

This paper describes (DrugEx), which extracts drugs and their attributes from clinical free-text notes. The study aims to evaluate the feasibility of using NLP and deep learning approaches for extracting and linking drug-associated attributes. It also presents an extensive error analysis of different methods. This effort was part of the 2018 National NLP Clinical Challenges (n2c2) Shared Task on Adverse Drug Events and Medication Extraction.

Methods:

The proposed method (DrugEx) consists of a named entity recogniser (NER) to identify drugs and associated attributes, and relation extraction (RE) component to identify relations between them. For the NER, we explored deep learning-based approaches (i.e. Bi-LSTM-CRFs) with various embeddings (i.e. word, character and semantic-feature embeddings) in order to investigate how different embeddings influence the performance. For RE, a rule-based method was implemented and compared with a positional-aware LSTM model. The methods were trained and evaluated using the 2018 n2c2 shared-task data.

Results:

Experiments showed that the best model (Bi-LSTM-CRFs with words and character embeddings) achieved lenient micro F-scores of 0.921 for NER, 0.927 for RE and 0.855 for the end-to-end system.

Conclusions:

The proposed end-to-end system achieves encouraging results and demonstrates the feasibility of using deep learning methods for extracting medication information from free-text data.

Citation

Please cite as:

Alfattni G, Belousov M, Peek N, Nenadic G

Extracting Drug Names and Associated Attributes From Discharge Summaries: Text Mining Study

JMIR Med Inform 2021;9(5):e24678

DOI: 10.2196/24678

PMID: 33949962

PMCID: 8135022

Download PDF

Request queued. Please wait while the file is being generated. It may take some time.

Copyright

© The authors. All rights reserved. This is a privileged document currently under peer-review/community review (or an accepted/rejected manuscript). Authors have provided JMIR Publications with an exclusive license to publish this preprint on it's website for review and ahead-of-print citation purposes only. While the final peer-reviewed paper may be licensed under a cc-by license on publication, at this stage authors and publisher expressively prohibit redistribution of this draft paper other than for review purposes.

JMIR Publications

JMIR Preprints

Accepted for/Published in: JMIR Medical Informatics

Date Submitted: Sep 30, 2020

Date Accepted: Feb 20, 2021

DrugEx: An End-to-End System to Extract Drug Names and Associated Attributes from Clinical Free-Text

ABSTRACT

Citation

Copyright