JMIR Preprints #11448: Protocol for Evaluating the Extraction of Adverse Drug Reactions Information in Social Media, the ADR-PRISM Project

Current Preprint Settings

(as selected by the authors)

1. Allow access to the preprint PDF upon submission to:

(a) Open peer-review purposes
(b) Logged-in users only
(c) Anybody, anytime (title and abstract only)
(d) Nobody

2. When the manuscript is accepted, display the accepted manuscript PDF to:

(a) Anybody, anytime
(b) Logged-in users only
(c) Anybody, anytime (title and abstract only)
(d) Nobody

3. When a final paper is published in a JMIR journal, display the preprint as follows:

(a) Allow download
(b) Show abstract only
(c) Do not display anything

4. If the paper is rejected from JMIR journals, display the preprint to:

(a) Logged-in users only
(b) Anybody, anytime
(c) Nobody

Protocol for Evaluating the Extraction of Adverse Drug Reactions Information in Social Media, the ADR-PRISM Project

Armelle Guenegou-Arnoux;
Yannick Girardeau;
Xiaoyi Chen;
Myrtille Deldossi;
Rim Aboukhamis;
Carole Faviez;
Badisse Dahamna;
Pierre Karapetiantz;
Sylvie Guillemin-Lanne;
Nathalie Texier;
Anita Burgun;
Sandrine Katsahian

ABSTRACT

Background:

Social media is today seriously emerging as a source of information on post-marketing pharmacovigilance (PV), i.e. adverse events occurring with marketed drugs. Social media grew in importance in the very last years but still remains largely unexploited nowadays. Quite a few researchers published their work on either drug names or adverse effects (AE) recognition or drug adverse reactions (ADR) information extraction. However, a Gold Standard, consisting in manual annotations of the ADR by human experts from the corpus extracted from social media, was not systematically implemented and its quality is not always assessed on its own. When existing, the sample size is arbitrary only and doesnâ€™t rely on statistical arguments. This questions the statistical reliability of the results. At last, some of the extraction methods just take into account any co-occurrence of a drug entity with one or several AE entitie(s) which leads to extract general information such as the indication of the drug in addition to regular ADR.

Objective:

In our work, we propose a standardized protocol for the evaluation of a software extracting information purely on ADRs.

Methods:

Messages from French health forums are extracted (evaluation dataset). Drug and AE entities recognition are based on lexicons: Racine Pharma thesaurus and MedDRA terminology respectively. NLP-based techniques automate the ADR information extraction (relation between the Drug and AE entities). Several concepts of ADRs will be chosen. The corpus for evaluation is a random sample of the messages containing these concepts from the evaluation dataset. Two persons experienced in medical terminology will manually annotate the corpus (Gold Standard). The study outcomes will be the precision and recall and their confidence intervals at 95%. For each concept, these outcomes will be computed against the Gold Standard. Necessary and sufficient sample size will be calculated to ensure statistical confidence in the assessed results. Gold Standard in its-self will be evaluated through Kappa inter-annotators agreements. Further analyses will enable to explore the granularity in the terminologies.

Results:

The automated ADR information extraction in the corpus for evaluation is already finished; the ADRsâ€™ concepts as well as the sample for the Gold Standard are selected. The Gold Standard is completed, we currently proceed to the data analyses and the study results are expected in 2018.

Conclusions:

This protocol is the first one that used standardized statistical methods to create the corpus for evaluation of a NLP tool, thus ensuring necessary statistical power of the assessed results. Once completed, our ADR information extraction software should interest different actors of our society (Health agencies, pharmaceutical companies, general public).

Citation

Please cite as:

Guenegou-Arnoux A, Girardeau Y, Chen X, Deldossi M, Aboukhamis R, Faviez C, Dahamna B, Karapetiantz P, Guillemin-Lanne S, Texier N, Burgun A, Katsahian S

The Adverse Drug Reactions From Patient Reports in Social Media Project: Protocol for an Evaluation Against a Gold Standard

JMIR Res Protoc 2019;8(5):e11448

DOI: 10.2196/11448

PMID: 31066711

PMCID: 6528435

Copyright

© The authors. All rights reserved. This is a privileged document currently under peer-review/community review (or an accepted/rejected manuscript). Authors have provided JMIR Publications with an exclusive license to publish this preprint on it's website for review and ahead-of-print citation purposes only. While the final peer-reviewed paper may be licensed under a cc-by license on publication, at this stage authors and publisher expressively prohibit redistribution of this draft paper other than for review purposes.

JMIR Publications

JMIR Preprints

Accepted for/Published in: JMIR Research Protocols

Date Submitted: Jun 29, 2018

Open Peer Review Period: Jul 3, 2018 - Aug 1, 2018

Date Accepted: Dec 21, 2018

(closed for review but you can still tweet)

Protocol for Evaluating the Extraction of Adverse Drug Reactions Information in Social Media, the ADR-PRISM Project

ABSTRACT

Citation

Copyright

JMIR Preprints

Accepted for/Published in: JMIR Research Protocols

Date Submitted: Jun 29, 2018

Open Peer Review Period: Jul 3, 2018 - Aug 1, 2018

Date Accepted: Dec 21, 2018

(closed for review but you can still tweet)

Protocol for Evaluating the Extraction of Adverse Drug Reactions Information in Social Media, the ADR-PRISM Project

ABSTRACT

Citation

Per the author's request the PDF is not available.

Copyright