JMIR Preprints #28632: Text mining of adverse events in clinical trials: Deep learning approach

Current Preprint Settings

(as selected by the authors)

1. When the manuscript is submitted, allow peer review from:

(a) Anybody (open community peer review)
(b) Editor-selected reviewers (closed peer review)

2. When the manuscript is submitted, display the preprint PDF to:

(a) Anybody, anytime
(b) Logged-in users only
(c) Anybody, anytime (title and abstract only)
(d) No one

3. When the manuscript is accepted, display the accepted manuscript PDF to:

(a) Anybody, anytime
(b) Logged-in users only
(c) Anybody, anytime (title and abstract only)
(d) No one

Text mining of adverse events in clinical trials: Deep learning approach

Daphne Chopard;
Matthias S. Treder;
Padraig Corcoran;
Nagheen Ahmed;
Claire Johnson;
Monica Busse;
Irena Spasic

ABSTRACT

Background:

Pharmacovigilance and safety reporting, which involves processes for monitoring the use of medicines in clinical trials, plays a critical role in the identification of previously unrecognized adverse events or changes in the patterns of adverse events.

Objective:

This study aimed to demonstrate feasibility of automating the coding of adverse events described in the narrative section of the serious adverse event report forms to enable a statistical analysis of the aforementioned patterns.

Methods:

We used the Uniﬁed Medical Language System (UMLS) as the coding scheme, which integrates 217 source vocabularies, thus enabling coding against other relevant terminologies such as ICD-10, MedDRA and SNOMED. We used MetaMap, highly configurable dictionary lookup software, to identify mentions of the UMLS concepts. We trained a binary classifier using Bidirectional Encoder Representations from Transformer (BERT), a transformer-based language model that captures contextual relationships, to differentiate between mentions of the UMLS concepts that represent adverse events and those that do not.

Results:

The model achieved a high F1 score of 0.8741 despite the class imbalance.

Conclusions:

These results confirmed that automated coding of adverse events described in the narrative section of the serious adverse event reports is feasible.

Citation

Please cite as:

Chopard D, Treder MS, Corcoran P, Ahmed N, Johnson C, Busse M, Spasic I

Text Mining of Adverse Events in Clinical Trials: Deep Learning Approach

JMIR Med Inform 2021;9(12):e28632

DOI: 10.2196/28632

PMID: 34951601

PMCID: 8742206

Download PDF

Request queued. Please wait while the file is being generated. It may take some time.

Copyright

© The authors. All rights reserved. This is a privileged document currently under peer-review/community review (or an accepted/rejected manuscript). Authors have provided JMIR Publications with an exclusive license to publish this preprint on it's website for review and ahead-of-print citation purposes only. While the final peer-reviewed paper may be licensed under a cc-by license on publication, at this stage authors and publisher expressively prohibit redistribution of this draft paper other than for review purposes.

JMIR Publications

JMIR Preprints

Accepted for/Published in: JMIR Medical Informatics

Date Submitted: Mar 9, 2021

Date Accepted: Nov 14, 2021

Text mining of adverse events in clinical trials: Deep learning approach

ABSTRACT

Citation

Copyright