Maintenance Notice

Due to necessary scheduled maintenance, the JMIR Publications website will be unavailable from Wednesday, July 01, 2020 at 8:00 PM to 10:00 PM EST. We apologize in advance for any inconvenience this may cause you.

Who will be affected?

Currently submitted to: JMIR Research Protocols

Date Submitted: Apr 1, 2026
Open Peer Review Period: Apr 2, 2026 - May 28, 2026
(currently open for review)

Warning: This is an author submission that is not peer-reviewed or edited. Preprints - unless they show as "accepted" - should not be relied on to guide clinical practice or health-related behavior and should not be reported in news media as established information.

Extracting Pain Severity and Functional Interference from Clinical Narratives Using Domain-Informed Large Language Models: Study Protocol

  • Xiaoyi Zhang; 
  • Chris Wilson; 
  • Hannah Eyre; 
  • David E. Reed II; 
  • Alexander Kloehn; 
  • Ethan Wan Rosser; 
  • Gang Luo; 
  • Steven Bacchus Zeliadt

ABSTRACT

Background:

Chronic pain is a leading cause of disability and requires multidimensional assessment of pain intensity and functioning, yet electronic health records (EHRs) rarely capture these measures systematically. By contrast, surveys collecting patient-reported outcomes can assess pain over multiple dimensions but remain resource-intensive and difficult to scale for continuous population-level monitoring.

Objective:

The objective of this study is to develop and validate a domain-informed natural language processing (NLP) framework to derive pain severity and functional interference outcomes from unstructured clinical narratives. Using a benchmark dataset of 3,726 Veterans with longitudinal survey measures, we aim to demonstrate that NLP-derived outcomes can serve as a reliable, scalable surrogate for resource-intensive patient-reported surveys.

Methods:

This study utilizes a retrospective cohort of 3,726 Veterans with chronic musculoskeletal pain initiating Complementary and Integrative Health (CIH) therapies across 18 Veterans Health Administration (VA) Whole Health Flagship sites (2021–2023). The dataset encompasses longitudinal patient-reported outcome surveys serving as the benchmark, linked with both structured and unstructured data from the VA EHR. To guide extraction and enable scalable processing, we developed a seed lexicon and annotation guidelines based on established psychometric instruments and input from subject matter experts (SMEs). Currently, SMEs are annotating clinical notes in iterative batches. Upon completion, a large language model (LLM) will annotate additional notes. These annotations will be used to fine-tune a lightweight language model capable of processing the entire cohort. The study employs a three-stage validation process, assessing: (1) documentation completeness; (2) inference accuracy, evaluating agreement between model outputs and SME annotations at both the LLM and fine-tuned model levels; and (3) concordance with patient-reported outcomes captured independently from the EHR.

Results:

To date, a cohort of 3,726 Veterans has been identified. The longitudinal survey data have been linked to clinical notes corresponding to the survey time period. The seed lexicon and annotation guidelines have been developed. Annotation is underway in iterative batches to drive the subsequent LLM adaptation and three-stage validation.

Conclusions:

This protocol outlines a framework for identifying severe pain interference from clinical narratives, addressing a critical gap in health care system surveillance. To our knowledge, this is the first study to validate clinical text-based pain outcome extraction against patient-reported outcomes in a nationwide longitudinal cohort. If successful, this approach will enable health care systems to continuously monitor pain-related functional interference and support more holistic, patient-centered pain management at scale.


 Citation

Please cite as:

Zhang X, Wilson C, Eyre H, Reed DE II, Kloehn A, Rosser EW, Luo G, Zeliadt SB

Extracting Pain Severity and Functional Interference from Clinical Narratives Using Domain-Informed Large Language Models: Study Protocol

JMIR Preprints. 01/04/2026:96346

DOI: 10.2196/preprints.96346

URL: https://preprints.jmir.org/preprint/96346

Download PDF


Request queued. Please wait while the file is being generated. It may take some time.

© The authors. All rights reserved. This is a privileged document currently under peer-review/community review (or an accepted/rejected manuscript). Authors have provided JMIR Publications with an exclusive license to publish this preprint on it's website for review and ahead-of-print citation purposes only. While the final peer-reviewed paper may be licensed under a cc-by license on publication, at this stage authors and publisher expressively prohibit redistribution of this draft paper other than for review purposes.