Targeted Development and Validation of Clinical Prediction Models in Secondary Care Settings: Opportunities and Challenges for Electronic Health Record Data

Current Preprint Settings

(as selected by the authors)

1. When the manuscript is submitted, allow peer review from:

(a) Anybody (open community peer review)
(b) Editor-selected reviewers (closed peer review)

2. When the manuscript is submitted, display the preprint PDF to:

(a) Anybody, anytime
(b) Logged-in users only
(c) Anybody, anytime (title and abstract only)
(d) No one

3. When the manuscript is accepted, display the accepted manuscript PDF to:

(a) Anybody, anytime
(b) Logged-in users only
(c) Anybody, anytime (title and abstract only)
(d) No one

The dawn of targeted development and validation of clinical prediction models in secondary care settings – opportunities and challenges for electronic health records data

Ingrid Susanne van Maurik;
H.J. Doodeman;
B.W. Veeger-Nuijens;
D.R. Sudiono;
W. Jongbloed;
E. van Soelen

ABSTRACT

Upon deploying a clinical prediction model (CPM) in clinical practice, the performance and generalizability of a CPM needs to be demonstrated in the population of intended use. This is also called ‘targeted validation’. In this viewpoint we consider targeted validation and the use of clinical prediction models (CPM) in a secondary care setting using Electronic Health Record (EHR) data. More than half of the patients with (complex) care needs are treated in leading clinical teaching hospitals in secondary care settings. The growing influx of patients combined with the worldwide problem of health workforce shortage requires efficient and effective organization of care. Moreover, the healthcare field increasingly moves towards health(care) decisions to be made by both the clinician and the patient (i.e. shared decision making). Against that background, CPMs may have particular use in a secondary care setting. Yet, CPMs are not always validated and if validation occurred, it is often not ‘targeted’ and/or focusses solely on discrimination of a CPM. The accuracy of predicted risks (calibration) is far less often assessed, while this is pivotal for transition of a CPMs into clinical practice. The introduction of artificial intelligence based software applications in secondary care settings enables the creation of statistically powerful datasets from unstructured EHRs. This comes with both technical and legal challenges. Upon using EHR data for the development and validation of CPMs, alongside the widely accepted checklists, we propose to additionally consider the three practical steps: 1) let a local EHR expert (clinician, nurse) be involved in the data extraction process, 2) perform validity checks on the generated datasets and 3) provide metadata on how variables were constructed from EHRs. If successful, such datasets are statistically powerful and opens the gates for targeted development and validation of CPMs in secondary care settings, filling a major gap in prediction modelling research and appropriately advancing CPM’s into clinical practice.

Citation

Please cite as:

van Maurik IS, Doodeman H, Veeger-Nuijens B, Sudiono D, Jongbloed W, van Soelen E

Targeted Development and Validation of Clinical Prediction Models in Secondary Care Settings: Opportunities and Challenges for Electronic Health Record Data

JMIR Med Inform 2024;12:e57035

DOI: 10.2196/57035

PMID: 39447145

PMCID: 11615704