Maintenance Notice

Due to necessary scheduled maintenance, the JMIR Publications website will be unavailable from Wednesday, July 01, 2020 at 8:00 PM to 10:00 PM EST. We apologize in advance for any inconvenience this may cause you.

Who will be affected?

Currently submitted to: JMIR AI

Date Submitted: Feb 24, 2026
Open Peer Review Period: Mar 3, 2026 - Apr 28, 2026
(currently open for review)

Warning: This is an author submission that is not peer-reviewed or edited. Preprints - unless they show as "accepted" - should not be relied on to guide clinical practice or health-related behavior and should not be reported in news media as established information.

From Skill Assessment to Surgical Teammates: A Scoping Review of Machine Learning for Human–Autonomy Teaming

  • Kamal Barati; 
  • Michael S. Ramirez Campos; 
  • Thomas E. Doyle

ABSTRACT

Background:

Human–autonomy teaming (HAT) has the potential to reshape surgical practice by fostering true partnership between surgeons and intelligent systems. To achieve this, AI must move beyond static scoring to provide adaptive, real-time guidance based on reliable skill assessment.

Objective:

This scoping review aims to map the current landscape of machine learning (ML) methods for surgical skill assessment and evaluate their technical readiness for integration into functional HAT systems.

Methods:

Following PRISMA-ScR guidelines, we conducted a systematic search across three major scientific databases (PubMed, IEEE Xplore, and Scopus). We identified and analyzed 92 peer-reviewed studies published between 2019 and 2025. The review focused on data modalities (kinematics, video, biosignals), model architectures, and validation environments.

Results:

Our analysis of the 92 included studies reveals a dominant shift toward multimodal data integration and deep learning architectures. While high performance is frequently reported on benchmark datasets, significant barriers to HAT integration persist. We identified that a majority of current models lack interpretability and fail to demonstrate generalizability to real-world clinical settings. Furthermore, validation practices remain inconsistent, with limited evidence of adaptivity to individual user needs during live surgical workflows.

Conclusions:

Current AI techniques provide a robust foundation for objective skill assessment, but they are not yet ready for autonomous teaming. Future development must prioritize model robustness, interpretability, and seamless integration into clinical environments to transition from standalone assessment tools to effective surgical teammates. Clinical Trial: The protocol was registered at OSF Registries [https://doi.org/10.17605/OSF.IO/PQWS5]


 Citation

Please cite as:

Barati K, Ramirez Campos MS, Doyle TE

From Skill Assessment to Surgical Teammates: A Scoping Review of Machine Learning for Human–Autonomy Teaming

JMIR Preprints. 24/02/2026:94109

DOI: 10.2196/preprints.94109

URL: https://preprints.jmir.org/preprint/94109

Download PDF


Request queued. Please wait while the file is being generated. It may take some time.

© The authors. All rights reserved. This is a privileged document currently under peer-review/community review (or an accepted/rejected manuscript). Authors have provided JMIR Publications with an exclusive license to publish this preprint on it's website for review and ahead-of-print citation purposes only. While the final peer-reviewed paper may be licensed under a cc-by license on publication, at this stage authors and publisher expressively prohibit redistribution of this draft paper other than for review purposes.