Currently submitted to: JMIR AI
Date Submitted: Nov 13, 2025
Open Peer Review Period: Dec 2, 2025 - Jan 27, 2026
(closed for review but you can still tweet)
NOTE: This is an unreviewed Preprint
Warning: This is a unreviewed preprint (What is a preprint?). Readers are warned that the document has not been peer-reviewed by expert/patient reviewers or an academic editor, may contain misleading claims, and is likely to undergo changes before final publication, if accepted, or may have been rejected/withdrawn (a note "no longer under consideration" will appear above).
Peer review me: Readers with interest and expertise are encouraged to sign up as peer-reviewer, if the paper is within an open peer-review period (in this case, a "Peer Review Me" button to sign up as reviewer is displayed above). All preprints currently open for review are listed here. Outside of the formal open peer-review period we encourage you to tweet about the preprint.
Citation: Please cite this preprint only for review purposes or for grant applications and CVs (if you are the author).
Final version: If our system detects a final peer-reviewed "version of record" (VoR) published in any journal, a link to that VoR will appear below. Readers are then encourage to cite the VoR instead of this preprint.
Settings: If you are the author, you can login and change the preprint display settings, but the preprint URL/DOI is supposed to be stable and citable, so it should not be removed once posted.
Submit: To post your own preprint, simply submit to any JMIR journal, and choose the appropriate settings to expose your submitted version as preprint.
Warning: This is an author submission that is not peer-reviewed or edited. Preprints - unless they show as "accepted" - should not be relied on to guide clinical practice or health-related behavior and should not be reported in news media as established information.
Subject-Aware Model Validation for Repeated-Measures Data: A Nested Approach for Trustworthy Medical AI Applications
ABSTRACT
Background:
Repeated-measures datasets are common in biomechanics and digital health, where each participant contributes multiple correlated trials. If cross-validation (CV) ignores this structure, information can leak from training to test folds, inflating performance and undermining clinical credibility.
Objective:
To evaluate the impact of subject-aware validation strategies on model reliability in repeated-measures classification tasks, using fear of re-injury prediction post–anterior cruciate ligament reconstruction (ACLR) as a case study.
Methods:
We analyzed 623 hop trials from 72 individuals post-ACLR to classify fear of re-injury based on biomechanical features. Four cross-validation (CV) strategies were compared: stratified 10-fold CV, Leave-One-Participant-Out CV (LOPOCV), Group 3-Fold CV, and a nested framework combining LOPOCV (outer loop) with Group 3-fold CV (inner loop). Ten supervised classifiers were benchmarked across classification accuracy, train–test generalization gap, model ranking consistency, and computational efficiency.
Results:
Stratified 10-Fold CV systematically overestimated model performance (e.g., Extra Trees accuracy of 0.91 vs. 0.66 under LOPOCV) due to subject-level data leakage. Group and nested CV strategies yielded more conservative and stable estimates. The nested LOPOCV + Group CV framework achieved a good balance between generalization and participant-level independence, with reduced bias and overfitting compared to non-nested alternatives.
Conclusions:
Subject-aware validation strategies are essential for trustworthy ML evaluation in repeated-measures settings. Nested CV designs improve reproducibility, reduce selection bias, and align with regulatory expectations for clinical ML tools. These findings support best practices in model validation for biomechanics and digital health applications.
Citation
Request queued. Please wait while the file is being generated. It may take some time.
Copyright
© The authors. All rights reserved. This is a privileged document currently under peer-review/community review (or an accepted/rejected manuscript). Authors have provided JMIR Publications with an exclusive license to publish this preprint on it's website for review and ahead-of-print citation purposes only. While the final peer-reviewed paper may be licensed under a cc-by license on publication, at this stage authors and publisher expressively prohibit redistribution of this draft paper other than for review purposes.