Maintenance Notice

Due to necessary scheduled maintenance, the JMIR Publications website will be unavailable from Wednesday, July 01, 2020 at 8:00 PM to 10:00 PM EST. We apologize in advance for any inconvenience this may cause you.

Who will be affected?

Accepted for/Published in: Journal of Medical Internet Research

Date Submitted: Mar 29, 2024
Date Accepted: Oct 23, 2025

The final, peer-reviewed published version of this preprint can be found here:

Transforming Surgical Training With AI Techniques for Training, Assessment, and Evaluation: Scoping Review

Escobar-Castillejos D, Barrera-Animas AY, Noguez J, Magana AJ, Benes B

Transforming Surgical Training With AI Techniques for Training, Assessment, and Evaluation: Scoping Review

J Med Internet Res 2025;27:e58966

DOI: 10.2196/58966

PMID: 41252719

PMCID: 12673302

Warning: This is an author submission that is not peer-reviewed or edited. Preprints - unless they show as "accepted" - should not be relied on to guide clinical practice or health-related behavior and should not be reported in news media as established information.

Transforming Surgical Training: A Systematic Review of AI Techniques for Assessment and Evaluation

  • David Escobar-Castillejos; 
  • Ari Y. Barrera-Animas; 
  • Julieta Noguez; 
  • Alejandra J. Magana; 
  • Bedrich Benes

ABSTRACT

Background:

Artificial intelligence (AI) has introduced novel opportunities for assessment and evaluation in surgical training, offering potential improvements that could surpass traditional educational methods.

Objective:

This systematic review explores the integration of AI in surgical training, assessment, and evaluation with the objective of determining how AI technologies can enhance trainees’ learning paths and performance by incorporating data-driven insights and predictive analytics. Moreover, this study review aims to examine the current state and applications of AI algorithms in this field, identifying potential areas for future research.

Methods:

Adhering to the Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA) guidelines and software engineering review protocols, this systematic review searched PubMed, Scopus, and Web of Science databases for relevant studies published from 2020 to 2024. The review is structured around four research questions: the specific surgical procedures to which AI is applied, the AI techniques used, how AI assesses and improves surgical training, and its effects on trainees' learning curves.

Results:

From an initial 1,400 records, 56 studies met the inclusion criteria and were analyzed. AI applications have been used extensively in the fields of minimally invasive surgery (MIS), neurosurgery, and laparoscopy, with machine learning (ML) and deep learning (DL) being the most common techniques used in the field. Moreover, simulation training was identified as the predominant setup. Several studies also reported significant improvements in trainees' learning curves when AI was incorporated into training protocols, although some authors pointed out the importance of further studies in this area.

Conclusions:

The use of AI in surgical training has exhibited the capacity to augment the process of learning and acquiring skills, hence facilitating the transition towards educational models that are more efficient, personalized, and adaptive. Future studies should aim to clarify AI methodologies and focus on scalable solutions that align with universal education goals. Moreover, simulation training emerges as a critical area for ongoing research and development, particularly including ML and DL techniques to design more insightful experiences.


 Citation

Please cite as:

Escobar-Castillejos D, Barrera-Animas AY, Noguez J, Magana AJ, Benes B

Transforming Surgical Training With AI Techniques for Training, Assessment, and Evaluation: Scoping Review

J Med Internet Res 2025;27:e58966

DOI: 10.2196/58966

PMID: 41252719

PMCID: 12673302

Download PDF


Request queued. Please wait while the file is being generated. It may take some time.

© The authors. All rights reserved. This is a privileged document currently under peer-review/community review (or an accepted/rejected manuscript). Authors have provided JMIR Publications with an exclusive license to publish this preprint on it's website for review and ahead-of-print citation purposes only. While the final peer-reviewed paper may be licensed under a cc-by license on publication, at this stage authors and publisher expressively prohibit redistribution of this draft paper other than for review purposes.