Maintenance Notice

Due to necessary scheduled maintenance, the JMIR Publications website will be unavailable from Wednesday, July 01, 2020 at 8:00 PM to 10:00 PM EST. We apologize in advance for any inconvenience this may cause you.

Who will be affected?

Accepted for/Published in: JMIR Medical Informatics

Date Submitted: Oct 13, 2017
Open Peer Review Period: Oct 13, 2017 - Nov 3, 2017
Date Accepted: Nov 8, 2017
(closed for review but you can still tweet)

The final, peer-reviewed published version of this preprint can be found here:

Adverse Drug Event Discovery Using Biomedical Literature: A Big Data Neural Network Adventure

P Tafti A, Badger J, LaRose E, Shirzadi E, Mahnke A, Mayer J, Ye Z, Page D, Peissig P

Adverse Drug Event Discovery Using Biomedical Literature: A Big Data Neural Network Adventure

JMIR Med Inform 2017;5(4):e51

DOI: 10.2196/medinform.9170

PMID: 29222076

PMCID: 5741828

Warning: This is an author submission that is not peer-reviewed or edited. Preprints - unless they show as "accepted" - should not be relied on to guide clinical practice or health-related behavior and should not be reported in news media as established information.

Adverse Drug Event Discovery Using Biomedical Literature: A Big Data Neural Network Adventure

  • Ahmad P Tafti; 
  • Jonathan Badger; 
  • Eric LaRose; 
  • Ehsan Shirzadi; 
  • Andrea Mahnke; 
  • John Mayer; 
  • Zhan Ye; 
  • David Page; 
  • Peggy Peissig

Background:

The study of adverse drug events (ADEs) is a tenured topic in medical literature. In recent years, increasing numbers of scientific articles and health-related social media posts have been generated and shared daily, albeit with very limited use for ADE study and with little known about the content with respect to ADEs.

Objective:

The aim of this study was to develop a big data analytics strategy that mines the content of scientific articles and health-related Web-based social media to detect and identify ADEs.

Methods:

We analyzed the following two data sources: (1) biomedical articles and (2) health-related social media blog posts. We developed an intelligent and scalable text mining solution on big data infrastructures composed of Apache Spark, natural language processing, and machine learning. This was combined with an Elasticsearch No-SQL distributed database to explore and visualize ADEs.

Results:

The accuracy, precision, recall, and area under receiver operating characteristic of the system were 92.7%, 93.6%, 93.0%, and 0.905, respectively, and showed better results in comparison with traditional approaches in the literature. This work not only detected and classified ADE sentences from big data biomedical literature but also scientifically visualized ADE interactions.

Conclusions:

To the best of our knowledge, this work is the first to investigate a big data machine learning strategy for ADE discovery on massive datasets downloaded from PubMed Central and social media. This contribution illustrates possible capacities in big data biomedical text analysis using advanced computational methods with real-time update from new data published on a daily basis.


 Citation

Please cite as:

P Tafti A, Badger J, LaRose E, Shirzadi E, Mahnke A, Mayer J, Ye Z, Page D, Peissig P

Adverse Drug Event Discovery Using Biomedical Literature: A Big Data Neural Network Adventure

JMIR Med Inform 2017;5(4):e51

DOI: 10.2196/medinform.9170

PMID: 29222076

PMCID: 5741828

Per the author's request the PDF is not available.

© The authors. All rights reserved. This is a privileged document currently under peer-review/community review (or an accepted/rejected manuscript). Authors have provided JMIR Publications with an exclusive license to publish this preprint on it's website for review and ahead-of-print citation purposes only. While the final peer-reviewed paper may be licensed under a cc-by license on publication, at this stage authors and publisher expressively prohibit redistribution of this draft paper other than for review purposes.