Maintenance Notice

Due to necessary scheduled maintenance, the JMIR Publications website will be unavailable from Wednesday, July 01, 2020 at 8:00 PM to 10:00 PM EST. We apologize in advance for any inconvenience this may cause you.

Who will be affected?

Accepted for/Published in: JMIR Aging

Date Submitted: Aug 7, 2024
Date Accepted: May 10, 2025

The final, peer-reviewed published version of this preprint can be found here:

Development and Validation of a Rule-Based Natural Language Processing Algorithm to Identify Falls in Inpatient Records of Older Adults: Retrospective Analysis

Qian XX, Chau PH, Fong DY, Ho M, Woo J

Development and Validation of a Rule-Based Natural Language Processing Algorithm to Identify Falls in Inpatient Records of Older Adults: Retrospective Analysis

JMIR Aging 2025;8:e65195

DOI: 10.2196/65195

PMID: 40627677

PMCID: 12262146

Development and Validation of a Rule-based Natural Language Processing Algorithm to Identify Falls in Inpatient Records of Older Adults

  • Xing Xing Qian; 
  • Pui Hing Chau; 
  • Daniel YT Fong; 
  • Mandy Ho; 
  • Jean Woo

ABSTRACT

Background:

To address fall underestimation by the International Classification of Diseases (ICD) in clinical settings, incorporating information from clinical notes via Natural Language Processing (NLP) has emerged as a solution. However, its application to inpatient notes has not been fully investigated.

Objective:

This study developed and validated a rule-based NLP algorithm to identify falls based on inpatient admission notes from older patients.

Methods:

This retrospective study used 12-year electronic inpatient records of patients aged ≥65 from public hospitals in Hong Kong. A random sample of 1,000 patients was drawn to develop the NLP algorithm. Manual review was the gold standard for assessing the algorithm's performance, with sensitivity, specificity, precision, and F-measure calculated at the record, episode, and patient levels. Additionally, the study compared the number of falls identified by ICD codes and clinical notes independently and combined.

Results:

Our NLP algorithm showed excellent performance, with a sensitivity, specificity, precision, and F-measure of 93.3%, 99.0%, 87.5%, and 0.903 at the record and episode level, and 92.9%, 98.3%, 89.7%, and 0.912 at the patient level. The combined identification strategy using ICD codes and the NLP method provided the most comprehensive capture of fall-related episodes and fallers.

Conclusions:

The rule-based NLP method proved efficient and accurate in detecting falls from clinical notes in inpatient episodes. For comprehensive capture of fall episodes and fallers, we recommend the combined use of ICD codes and the NLP algorithm, which should be applied in future fall epidemiology studies and clinical practice for identifying high-risk groups of fall interventions.


 Citation

Please cite as:

Qian XX, Chau PH, Fong DY, Ho M, Woo J

Development and Validation of a Rule-Based Natural Language Processing Algorithm to Identify Falls in Inpatient Records of Older Adults: Retrospective Analysis

JMIR Aging 2025;8:e65195

DOI: 10.2196/65195

PMID: 40627677

PMCID: 12262146

Download PDF


Request queued. Please wait while the file is being generated. It may take some time.

© The authors. All rights reserved. This is a privileged document currently under peer-review/community review (or an accepted/rejected manuscript). Authors have provided JMIR Publications with an exclusive license to publish this preprint on it's website for review and ahead-of-print citation purposes only. While the final peer-reviewed paper may be licensed under a cc-by license on publication, at this stage authors and publisher expressively prohibit redistribution of this draft paper other than for review purposes.