Maintenance Notice

Due to necessary scheduled maintenance, the JMIR Publications website will be unavailable from Wednesday, July 01, 2020 at 8:00 PM to 10:00 PM EST. We apologize in advance for any inconvenience this may cause you.

Who will be affected?

Accepted for/Published in: JMIR Medical Informatics

Date Submitted: Mar 9, 2021
Date Accepted: Nov 14, 2021

The final, peer-reviewed published version of this preprint can be found here:

Text Mining of Adverse Events in Clinical Trials: Deep Learning Approach

Chopard D, Treder MS, Corcoran P, Ahmed N, Johnson C, Busse M, Spasic I

Text Mining of Adverse Events in Clinical Trials: Deep Learning Approach

JMIR Med Inform 2021;9(12):e28632

DOI: 10.2196/28632

PMID: 34951601

PMCID: 8742206

Text mining of adverse events in clinical trials: Deep learning approach

  • Daphne Chopard; 
  • Matthias S. Treder; 
  • Padraig Corcoran; 
  • Nagheen Ahmed; 
  • Claire Johnson; 
  • Monica Busse; 
  • Irena Spasic

ABSTRACT

Background:

Pharmacovigilance and safety reporting, which involves processes for monitoring the use of medicines in clinical trials, plays a critical role in the identification of previously unrecognized adverse events or changes in the patterns of adverse events.

Objective:

This study aimed to demonstrate feasibility of automating the coding of adverse events described in the narrative section of the serious adverse event report forms to enable a statistical analysis of the aforementioned patterns.

Methods:

We used the Unified Medical Language System (UMLS) as the coding scheme, which integrates 217 source vocabularies, thus enabling coding against other relevant terminologies such as ICD-10, MedDRA and SNOMED. We used MetaMap, highly configurable dictionary lookup software, to identify mentions of the UMLS concepts. We trained a binary classifier using Bidirectional Encoder Representations from Transformer (BERT), a transformer-based language model that captures contextual relationships, to differentiate between mentions of the UMLS concepts that represent adverse events and those that do not.

Results:

The model achieved a high F1 score of 0.8741 despite the class imbalance.

Conclusions:

These results confirmed that automated coding of adverse events described in the narrative section of the serious adverse event reports is feasible.


 Citation

Please cite as:

Chopard D, Treder MS, Corcoran P, Ahmed N, Johnson C, Busse M, Spasic I

Text Mining of Adverse Events in Clinical Trials: Deep Learning Approach

JMIR Med Inform 2021;9(12):e28632

DOI: 10.2196/28632

PMID: 34951601

PMCID: 8742206

Download PDF


Request queued. Please wait while the file is being generated. It may take some time.

© The authors. All rights reserved. This is a privileged document currently under peer-review/community review (or an accepted/rejected manuscript). Authors have provided JMIR Publications with an exclusive license to publish this preprint on it's website for review and ahead-of-print citation purposes only. While the final peer-reviewed paper may be licensed under a cc-by license on publication, at this stage authors and publisher expressively prohibit redistribution of this draft paper other than for review purposes.