Accepted for/Published in: JMIR Research Protocols
Date Submitted: Oct 7, 2025
Date Accepted: Apr 22, 2026
Evaluation of latent health risk prediction models: A protocol for a mixed-methods study using clinical triage as a vehicle for comparison and discussion
ABSTRACT
Background:
Determining clinical urgency and resource allocation within acute patient populations is complex. Tools are being developed to capture global assessments of patient health beyond disease-specific scores, aiming to provide dynamic assessments incorporating both baseline physiological reserve and immediate illness severity
Objective:
This study evaluates two contrasting approaches to latent health measurement: FI-lab, a transparent algorithmic tool using bottom-up aggregation of laboratory abnormalities, and ETHOS-ARES, a transformer-based model using top-down learning from electronic health records.
Methods:
This two-phase mixed-methods study uses clinical triage scenarios with ≥30 clinicians sampled across hospital roles (physicians, surgeons, critical care nurses, advanced practitioners). Phase 1 compares unaided clinician judgments of severity and clinical urgency against model outputs using Spearman's rank correlation (ρ ≥ 0.70 indicates good agreement as primary outcome). A novel "clinical Turing test" assesses whether model rankings are statistically distinguishable from clinician assessments. Phase 2 allows clinicians to incorporate model outputs into identical tasks, measuring anchoring effects through within-person pre/post comparison. Semi-structured interviews explore clinical utility, trust, and perceived limitations. Case materials derive from MIMIC-IV-ED, presented as slide decks containing emergency department documentation, examination findings, laboratory results, and imaging reports. Qualitative analysis follows the Framework Method with dual independent coding.
Results:
Data collection is planned for October-November 2025, with analysis to follow in December 2025.
Conclusions:
We anticipate findings will quantify agreement between model outputs and clinician consensus, measure any anchoring effects from model exposure, and generate insights from qualitative data regarding clinical utility, feasibility, and factors influencing clinician trust and adoption of different approaches to latent health measurement. Clinical Trial: n/a
Citation
Request queued. Please wait while the file is being generated. It may take some time.
Copyright
© The authors. All rights reserved. This is a privileged document currently under peer-review/community review (or an accepted/rejected manuscript). Authors have provided JMIR Publications with an exclusive license to publish this preprint on it's website for review and ahead-of-print citation purposes only. While the final peer-reviewed paper may be licensed under a cc-by license on publication, at this stage authors and publisher expressively prohibit redistribution of this draft paper other than for review purposes.