Maintenance Notice

Due to necessary scheduled maintenance, the JMIR Publications website will be unavailable from Wednesday, July 01, 2020 at 8:00 PM to 10:00 PM EST. We apologize in advance for any inconvenience this may cause you.

Who will be affected?

Accepted for/Published in: JMIR AI

Date Submitted: May 23, 2025
Date Accepted: Mar 7, 2026

The final, peer-reviewed published version of this preprint can be found here:

Application of Language Models for the Analysis of Adverse Drug Events in Pharmaceutical Research and Development: Scoping Review

Schreier O, Yazdani A, Galdadas I, Kabak R, Gervasio FL, Mu G, Teodoro D

Application of Language Models for the Analysis of Adverse Drug Events in Pharmaceutical Research and Development: Scoping Review

JMIR AI 2026;5:e77732

DOI: 10.2196/77732

PMID: 42302306

PMCID: 13271608

Application of language models for the analysis of adverse drug events in pharmaceutical research and development: A scoping Review

  • Oren Schreier; 
  • Anthony Yazdani; 
  • Ioannis Galdadas; 
  • Ryme Kabak; 
  • Francesco Luigi Gervasio; 
  • Gang Mu; 
  • Douglas Teodoro

ABSTRACT

Background:

Adverse drug events (ADEs) are a major cause of morbidity and mortality. Recent advances in artificial intelligence (AI), particularly deep learning, have enabled the development of models specifically designed for the prediction and detection of ADEs across all stages of drug development.

Objective:

This scoping review aims to provide a comprehensive overview of how AI methods are applied to predict and detect ADEs throughout the drug development pipeline, from preclinical research to post-market surveillance.

Methods:

We conducted a scoping review in accordance with PRISMA-ScR guidelines. A systematic search of PubMed, Web of Science, and Google Scholar identified 1,802 records published between January 2015 and December 2022. After screening and eligibility assessment, 81 studies were included in the final analysis. Inclusion criteria focused on articles using AI to analyze ADEs. Data extraction covered, among other elements, algorithm type, method type, features, data sources, prediction tasks, evaluation metrics, and application stage.

Results:

Among the 81 included studies, 37 addressed pre-market ADE prediction and 44 focused on post-market detection. Most studies originated from the United States and China. The liver and heart were the most commonly studied organs due to their critical roles in systemic drug response. A shift from traditional methods to deep learning approaches is emerging, with transformers in particular becoming increasingly dominant. Commonly used datasets included SIDER, EHR from clinical notes, n2c2 clinical notes, and DrugBank. Text embeddings were the dominant feature representation for detection tasks. Evaluation metrics differed by phase: AUROC was prevalent in pre-market prediction (n=32), while F1-score dominated post-market detection (n=39). Major challenges included the lack of detailed dosage data, limited integration of molecular target and pathway information, and class imbalance in datasets, all of which affect model interpretability and performance assessment.

Conclusions:

Although still emerging, the application of AI in ADE analysis shows significant promise. Our review highlights that various deep learning approaches have already been successfully implemented. As these technologies continue to evolve, they are expected to enhance drug safety, reduce healthcare costs, and support timely pharmacovigilance. Improvements in data quality, model interpretability, and methodological robustness will be essential to facilitate broader clinical adoption. Clinical Trial: Not applicable


 Citation

Please cite as:

Schreier O, Yazdani A, Galdadas I, Kabak R, Gervasio FL, Mu G, Teodoro D

Application of Language Models for the Analysis of Adverse Drug Events in Pharmaceutical Research and Development: Scoping Review

JMIR AI 2026;5:e77732

DOI: 10.2196/77732

PMID: 42302306

PMCID: 13271608

Download PDF


Request queued. Please wait while the file is being generated. It may take some time.

© The authors. All rights reserved. This is a privileged document currently under peer-review/community review (or an accepted/rejected manuscript). Authors have provided JMIR Publications with an exclusive license to publish this preprint on it's website for review and ahead-of-print citation purposes only. While the final peer-reviewed paper may be licensed under a cc-by license on publication, at this stage authors and publisher expressively prohibit redistribution of this draft paper other than for review purposes.