Maintenance Notice

Due to necessary scheduled maintenance, the JMIR Publications website will be unavailable from Wednesday, July 01, 2020 at 8:00 PM to 10:00 PM EST. We apologize in advance for any inconvenience this may cause you.

Who will be affected?

Accepted for/Published in: Journal of Medical Internet Research

Date Submitted: Jul 16, 2022
Open Peer Review Period: Jul 15, 2022 - Sep 9, 2022
Date Accepted: Dec 19, 2022
(closed for review but you can still tweet)

The final, peer-reviewed published version of this preprint can be found here:

Supervised Relation Extraction Between Suicide-Related Entities and Drugs: Development and Usability Study of an Annotated PubMed Corpus

Karapetian K, Jeon SM, Kwon JW, Suh YK

Supervised Relation Extraction Between Suicide-Related Entities and Drugs: Development and Usability Study of an Annotated PubMed Corpus

J Med Internet Res 2023;25:e41100

DOI: 10.2196/41100

PMID: 36884281

PMCID: 10034613

An Annotated PubMed Corpus to Support Supervised Relation Extraction between Suicide-Related Entities and Drugs

  • Karina Karapetian; 
  • Soo Min Jeon; 
  • Jin-Won Kwon; 
  • Young-Kyoon Suh

ABSTRACT

Background:

Drug-induced suicide has been debated as a crucial issue in clinical and public health research. Published research articles are valuable data resources to find information on drugs associated with suicidal adverse events. It is essential to apply an automated process to extract such information and rapidly detect drugs related to suicide risk. Still, such a process is not well established, and there has also been little dataset to train and validate models to classify drug-induced suicide.

Objective:

This study aims to build a drug-suicide relations (DSR) corpus of annotation of entities for drugs and suicidal events and their relations. To confirm the effectiveness of the DSR corpus, we evaluate the performance of a relation classification model in conjunction with various embeddings applying the corpus.

Methods:

We collect abstracts and titles of research articles associated with drugs and suicide from PubMed. We conduct manual annotation for entities of drug and suicide and their relation (as adverse drug events, treatment, suicide means, and miscellaneous) at the sentence level. To reduce the manual annotation effort, we run the annotation procedure after selecting sentences with pre-trained zero-shot classifier or selecting only sentences with both drug and suicide keywords. We train a relation classification model using various BERT (Bidirectional Encoder Representations from Transformers) embeddings with the proposed corpus. We then evaluate the performance of the model to determine the best BERT-based embedding that best suits our corpus.

Results:

Our corpus comprises 11,894 sentences from titles and abstracts of research articles extracted from PubMed. Each sentence is annotated with (1) drug and suicide entities and (2) the relation between these two entities, including adverse drug events, treatment, means, miscellaneous, and others. All relation classification models that we observe are fine-tuned based on our corpus. The models achieve F1 scores above 0.8 in detecting sentences of suicidal adverse events, regardless of the pre-trained model and dataset properties.

Conclusions:

To the best of our knowledge, the proposed corpus is the first and the most extensive corpus targeting drug-suicide relations.


 Citation

Please cite as:

Karapetian K, Jeon SM, Kwon JW, Suh YK

Supervised Relation Extraction Between Suicide-Related Entities and Drugs: Development and Usability Study of an Annotated PubMed Corpus

J Med Internet Res 2023;25:e41100

DOI: 10.2196/41100

PMID: 36884281

PMCID: 10034613

Download PDF


Request queued. Please wait while the file is being generated. It may take some time.

© The authors. All rights reserved. This is a privileged document currently under peer-review/community review (or an accepted/rejected manuscript). Authors have provided JMIR Publications with an exclusive license to publish this preprint on it's website for review and ahead-of-print citation purposes only. While the final peer-reviewed paper may be licensed under a cc-by license on publication, at this stage authors and publisher expressively prohibit redistribution of this draft paper other than for review purposes.