Maintenance Notice

Due to necessary scheduled maintenance, the JMIR Publications website will be unavailable from Wednesday, July 01, 2020 at 8:00 PM to 10:00 PM EST. We apologize in advance for any inconvenience this may cause you.

Who will be affected?

Accepted for/Published in: JMIR Research Protocols

Date Submitted: Apr 29, 2025
Date Accepted: Aug 12, 2025
(closed for review but you can still tweet)

The final, peer-reviewed published version of this preprint can be found here:

Identifying Firearm Violence Exposure in Primary Care Clinical Notes: Protocol for Developing a National Language Processing Text Classifier

Carwright N, Biel FM, Hoopes M, Bataineh AA, Rivera P, Bet K, Cook N

Identifying Firearm Violence Exposure in Primary Care Clinical Notes: Protocol for Developing a National Language Processing Text Classifier

JMIR Res Protoc 2025;14:e76681

DOI: 10.2196/76681

PMID: 40910696

PMCID: 12449666

Warning: This is an author submission that is not peer-reviewed or edited. Preprints - unless they show as "accepted" - should not be relied on to guide clinical practice or health-related behavior and should not be reported in news media as established information.

Identifying Firearm Violence Exposure in Primary Care Clinical Notes: Protocol for Developing a National Language Processing Text Classifier

  • Natalie Carwright; 
  • Frances. M. Biel; 
  • Megan Hoopes; 
  • Ali Al Bataineh; 
  • Pedro Rivera; 
  • Kerry Bet; 
  • Nicole Cook

ABSTRACT

Background:

Firearm violence injury is captured via structured data codes that best reflect acute bodily injury. There are no structured data codes to describe secondary exposure (e.g. witnessing a shooting, being threatened by a firearm, or losing a loved one to gun violence and injury from firearms) even though such exposure is associated with many long and short term health impacts. Clinical chart notes from Electronic Health Records (EHRs) often contain data not otherwise captured in structured data fields and can be categorized using natural language processing (NLP).

Objective:

The study protocol described here outlines the steps being taken to develop an NLP text classifier for determination of exposure to firearm violence from ambulatory primary care and behavioral health EHR clinical progress notes for persons aged ≥ 5 years.

Methods:

describe the process for arriving at a novel NLP lexicon, clinical process note selection, the steps for text classifier training and selection, and evaluation of model performance. We also describe involvement of a stakeholder advisory committee in the development of the lexicon, and how the lexicon addresses biases inherent in NLP text classifiers.

Results:

We describe the development of an NLP lexicon with the input of a stakeholder advisory committee, evaluation of the text classifier, and future plans for utilization of the text classifier.

Conclusions:

This work describes the development of a novel NLP text classifier to identify exposure to firearm violence in ambulatory primary care and behavioral health clinical progress notes. Clinical Trial: This is an IRB exempt non-intervention data study and was not registered on clinicaltrials.gov


 Citation

Please cite as:

Carwright N, Biel FM, Hoopes M, Bataineh AA, Rivera P, Bet K, Cook N

Identifying Firearm Violence Exposure in Primary Care Clinical Notes: Protocol for Developing a National Language Processing Text Classifier

JMIR Res Protoc 2025;14:e76681

DOI: 10.2196/76681

PMID: 40910696

PMCID: 12449666

Per the author's request the PDF is not available.