JMIR Preprints #45849: Development of a Corpus Annotated with Mentions of Pain in Mental Health Records: A Natural Language Processing Approach

Current Preprint Settings

(as selected by the authors)

1. When the manuscript is submitted, allow peer review from:

(a) Anybody (open community peer review)
(b) Editor-selected reviewers (closed peer review)

2. When the manuscript is submitted, display the preprint PDF to:

(a) Anybody, anytime
(b) Logged-in users only
(c) Anybody, anytime (title and abstract only)
(d) No one

3. When the manuscript is accepted, display the accepted manuscript PDF to:

(a) Anybody, anytime
(b) Logged-in users only
(c) Anybody, anytime (title and abstract only)
(d) No one

Development of a Corpus Annotated with Mentions of Pain in Mental Health Records: A Natural Language Processing Approach

Jaya Chaturvedi;
Natalia Chance;
Luwaiza Mirza;
Veshalee Vernugopan;
Sumithra Velupillai;
Robert Stewart;
Angus Roberts

ABSTRACT

Pain is a widespread issue, with 20% of adults suffering globally. A strong association has been demonstrated between pain and mental health conditions, and this association is known to exacerbate disability and impairment. Pain is also also known to be strongly related to emotions, which can lead to damaging consequences. As pain is a common reason for people to access healthcare facilities, electronic health records (EHRs) are a potential source of information on this pain. Mental health EHRs could be particularly beneficial since they can show the overlap of pain with mental health. Most mental health EHRs contain the majority of their information within the free-text sections of the records. However, it is challenging to extract information from free-text. Natural language processing (NLP) methods are therefore required to extract this information from the text. This research describes the development of a corpus of manually labelled mentions of pain and pain-related entities from the documents of a mental health EHR database, for use in the development and evaluation of future NLP methods. The EHR database used, CRIS (Clinical Record Interactive Search), consists of anonymised patient records from The South London and Maudsley (SLaM) NHS Foundation Trust in the UK. The corpus was developed through a process of manual annotation where pain mentions were marked as relevant (i.e., referring to physical pain afflicting the patient), negated (i.e., indicating absence of pain) or not-relevant (i.e. referring to pain affecting someone other than the patient, or metaphorical and hypothetical mentions). Relevant mentions were also annotated with additional attributes such as anatomical location affected by pain, pain character, and pain management measures, if mentioned. Over 70% of the mentions found within the documents were annotated as relevant, and about half of these mentions also included the anatomical location affected by the pain. In future work, the extracted information will be used to develop and evaluate a machine learning based NLP application to automatically extract relevant pain information from EHR databases.

Citation

Please cite as:

Chaturvedi J, Chance N, Mirza L, Vernugopan V, Velupillai S, Stewart R, Roberts A

Development of a Corpus Annotated With Mentions of Pain in Mental Health Records: Natural Language Processing Approach

JMIR Form Res 2023;7:e45849

DOI: 10.2196/45849

PMID: 37358897

PMCID: 10337440

Download PDF

Request queued. Please wait while the file is being generated. It may take some time.

Copyright

© The authors. All rights reserved. This is a privileged document currently under peer-review/community review (or an accepted/rejected manuscript). Authors have provided JMIR Publications with an exclusive license to publish this preprint on it's website for review and ahead-of-print citation purposes only. While the final peer-reviewed paper may be licensed under a cc-by license on publication, at this stage authors and publisher expressively prohibit redistribution of this draft paper other than for review purposes.

JMIR Publications

JMIR Preprints

Accepted for/Published in: JMIR Formative Research

Date Submitted: Jan 19, 2023

Date Accepted: Apr 6, 2023

Development of a Corpus Annotated with Mentions of Pain in Mental Health Records: A Natural Language Processing Approach

ABSTRACT

Citation

Copyright