JMIR Preprints #45289: Bi-LSTM based detection of ADR posts using Korean SNS data

Current Preprint Settings

(as selected by the authors)

1. When the manuscript is submitted, allow peer review from:

(a) Anybody (open community peer review)
(b) Editor-selected reviewers (closed peer review)

2. When the manuscript is submitted, display the preprint PDF to:

(a) Anybody, anytime
(b) Logged-in users only
(c) Anybody, anytime (title and abstract only)
(d) No one

3. When the manuscript is accepted, display the accepted manuscript PDF to:

(a) Anybody, anytime
(b) Logged-in users only
(c) Anybody, anytime (title and abstract only)
(d) No one

Bi-LSTM based detection of ADR posts using Korean SNS data

Chung-Chun Lee;
Seunghee Lee;
Mi-Hwa Song;
Jong-Yeup Kim;
Suehyun Lee

ABSTRACT

Background:

Social networking services (SNS) closely reflect the lives of individuals in modern society and generate large amounts of data. Previous studies have extracted drug information using relevant SNS data. In particular, it is important to detect adverse drug reactions (ADRs) early using drug surveillance systems. To this end, various deep learning methods have been employed, which have studied multiple languages in addition to English.

Objective:

A cautionary drug that can cause ADRs in elderly patients was selected, and Korean SNS data containing this drug information were collected. Based on this information, we aimed to develop a deep learning model that classifies drug ADR posts based on a Recurrent Neural Network (RNN).

Methods:

In previous studies, ketoprofen, which has a high prescription frequency and thus, was selected as the target drug. Posts were collected from portal site containing information about the drug, and NLP techniques for data written in Korean were applied. Posts containing relevant drug names and ADR word pairs were filtered through association analysis, and training data were generated through manual labeling tasks. So, a Bi-LSTM classification model was generated. The entire process was further verified using aceclofenac.

Results:

Among the NSAIDs, Korean SNS posts containing Ketoprofen and Aceclofenac information were secured, and the generic name lexicon, ADR lexicon, and Korean stop word lexicon were generated. In addition, to improve the accuracy of the classification model, an embedding layer was created considering the association between the drug name and the ADR word. In the ADR post-classification test, ketoprofen and aceclofenac achieved 87% and 80% accuracy, respectively.

Conclusions:

Here, we propose a process for developing a model for classifying ADR posts using SNS data. After analyzing drug name-ADR patterns, we filtered high-quality data by extracted posts, including known ADR words based on the analysis. Based on these data, we developed a model that classifies ADR posts. This confirmed that a model that can leverage social data to monitor ADRs automatically is feasible.

Citation

Please cite as:

Lee CC, Lee S, Song MH, Kim JY, Lee S

Bidirectional Long Short-Term Memory–Based Detection of Adverse Drug Reaction Posts Using Korean Social Networking Services Data: Deep Learning Approaches

JMIR Med Inform 2024;12:e45289

DOI: 10.2196/45289

PMID: 39565685

PMCID: 11601139

Download PDF

Request queued. Please wait while the file is being generated. It may take some time.

Copyright

© The authors. All rights reserved. This is a privileged document currently under peer-review/community review (or an accepted/rejected manuscript). Authors have provided JMIR Publications with an exclusive license to publish this preprint on it's website for review and ahead-of-print citation purposes only. While the final peer-reviewed paper may be licensed under a cc-by license on publication, at this stage authors and publisher expressively prohibit redistribution of this draft paper other than for review purposes.

JMIR Publications

JMIR Preprints

Accepted for/Published in: JMIR Medical Informatics

Date Submitted: Dec 23, 2022

Date Accepted: Aug 6, 2024

Bi-LSTM based detection of ADR posts using Korean SNS data

ABSTRACT

Citation

Copyright