JMIR Preprints #65195: Development and Validation of a Rule-based Natural Language Processing Algorithm to Identify Falls in Inpatient Records of Older Adults

Current Preprint Settings

(as selected by the authors)

1. When the manuscript is submitted, allow peer review from:

(a) Anybody (open community peer review)
(b) Editor-selected reviewers (closed peer review)

2. When the manuscript is submitted, display the preprint PDF to:

(a) Anybody, anytime
(b) Logged-in users only
(c) Anybody, anytime (title and abstract only)
(d) No one

3. When the manuscript is accepted, display the accepted manuscript PDF to:

(a) Anybody, anytime
(b) Logged-in users only
(c) Anybody, anytime (title and abstract only)
(d) No one

Development and Validation of a Rule-based Natural Language Processing Algorithm to Identify Falls in Inpatient Records of Older Adults

Xing Xing Qian;
Pui Hing Chau;
Daniel YT Fong;
Mandy Ho;
Jean Woo

ABSTRACT

Background:

To address fall underestimation by the International Classification of Diseases (ICD) in clinical settings, incorporating information from clinical notes via Natural Language Processing (NLP) has emerged as a solution. However, its application to inpatient notes has not been fully investigated.

Objective:

This study developed and validated a rule-based NLP algorithm to identify falls based on inpatient admission notes from older patients.

Methods:

This retrospective study used 12-year electronic inpatient records of patients aged ≥65 from public hospitals in Hong Kong. A random sample of 1,000 patients was drawn to develop the NLP algorithm. Manual review was the gold standard for assessing the algorithm's performance, with sensitivity, specificity, precision, and F-measure calculated at the record, episode, and patient levels. Additionally, the study compared the number of falls identified by ICD codes and clinical notes independently and combined.

Results:

Our NLP algorithm showed excellent performance, with a sensitivity, specificity, precision, and F-measure of 93.3%, 99.0%, 87.5%, and 0.903 at the record and episode level, and 92.9%, 98.3%, 89.7%, and 0.912 at the patient level. The combined identification strategy using ICD codes and the NLP method provided the most comprehensive capture of fall-related episodes and fallers.

Conclusions:

The rule-based NLP method proved efficient and accurate in detecting falls from clinical notes in inpatient episodes. For comprehensive capture of fall episodes and fallers, we recommend the combined use of ICD codes and the NLP algorithm, which should be applied in future fall epidemiology studies and clinical practice for identifying high-risk groups of fall interventions.

Citation

Please cite as:

Qian XX, Chau PH, Fong DY, Ho M, Woo J

Development and Validation of a Rule-Based Natural Language Processing Algorithm to Identify Falls in Inpatient Records of Older Adults: Retrospective Analysis

JMIR Aging 2025;8:e65195

DOI: 10.2196/65195

PMID: 40627677

PMCID: 12262146

Download PDF

Request queued. Please wait while the file is being generated. It may take some time.

Copyright

© The authors. All rights reserved. This is a privileged document currently under peer-review/community review (or an accepted/rejected manuscript). Authors have provided JMIR Publications with an exclusive license to publish this preprint on it's website for review and ahead-of-print citation purposes only. While the final peer-reviewed paper may be licensed under a cc-by license on publication, at this stage authors and publisher expressively prohibit redistribution of this draft paper other than for review purposes.

JMIR Publications

JMIR Preprints

Accepted for/Published in: JMIR Aging

Date Submitted: Aug 7, 2024

Date Accepted: May 10, 2025

Development and Validation of a Rule-based Natural Language Processing Algorithm to Identify Falls in Inpatient Records of Older Adults

ABSTRACT

Citation

Copyright