Maintenance Notice

Due to necessary scheduled maintenance, the JMIR Publications website will be unavailable from Wednesday, July 01, 2020 at 8:00 PM to 10:00 PM EST. We apologize in advance for any inconvenience this may cause you.

Who will be affected?

Accepted for/Published in: JMIR Medical Informatics

Date Submitted: Oct 3, 2025
Date Accepted: May 17, 2026

The final, peer-reviewed published version of this preprint can be found here:

Predicting Laboratory Test Ordering in Emergency Departments Using Integrated Structured and Unstructured Electronic Health Records: Machine Learning Study

Zhang X, Ling H, Zhang X, Zhang A

Predicting Laboratory Test Ordering in Emergency Departments Using Integrated Structured and Unstructured Electronic Health Records: Machine Learning Study

JMIR Med Inform 2026;14:e85255

DOI: 10.2196/85255

PMID: 42296534

Warning: This is an author submission that is not peer-reviewed or edited. Preprints - unless they show as "accepted" - should not be relied on to guide clinical practice or health-related behavior and should not be reported in news media as established information.

Predicting Laboratory Test Utilization in Emergency Departments Using Machine Learning Models Integrating Structured and Unstructured Electronic Health Records

  • Xingyu Zhang; 
  • Haipeng Ling; 
  • Xin Zhang; 
  • Anao Zhang

ABSTRACT

Laboratory testing is essential in emergency departments (EDs), but overuse contributes to unnecessary costs. This study explored machine learning models to predict lab test utilization during ED visits using both structured and unstructured electronic health record (EHR) data. We analyzed 15,115 adult visits from the 2021 National Hospital Ambulatory Medical Care Survey–ED. Structured variables included demographics, vital signs, insurance, and medical history; unstructured text from chief complaints and injury descriptions was processed using BERT embeddings. Four models were trained across structured-only, unstructured-only, combined, and ensemble configurations. The combined model achieved the highest AUC (0.83), outperforming structured-only (0.78) and unstructured-only (0.74) models. Key predictors of testing included older age, ambulance arrival, abnormal vitals, and chronic conditions, while injury-related visits predicted lower use. Integrating structured and unstructured EHR data improves lab test prediction and supports the development of decision support tools to promote more efficient diagnostic practices in EDs.


 Citation

Please cite as:

Zhang X, Ling H, Zhang X, Zhang A

Predicting Laboratory Test Ordering in Emergency Departments Using Integrated Structured and Unstructured Electronic Health Records: Machine Learning Study

JMIR Med Inform 2026;14:e85255

DOI: 10.2196/85255

PMID: 42296534

PDF not available

Per the author's request the PDF is not available.