Maintenance Notice

Due to necessary scheduled maintenance, the JMIR Publications website will be unavailable from Wednesday, July 01, 2020 at 8:00 PM to 10:00 PM EST. We apologize in advance for any inconvenience this may cause you.

Who will be affected?

Currently submitted to: Journal of Medical Internet Research

Date Submitted: Feb 20, 2026
Open Peer Review Period: Feb 21, 2026 - Apr 18, 2026
(currently open for review)

Warning: This is an author submission that is not peer-reviewed or edited. Preprints - unless they show as "accepted" - should not be relied on to guide clinical practice or health-related behavior and should not be reported in news media as established information.

Surgeons’ Perceptions of Machine-learning‑based Prognostic Tools for Spine Surgery: A Qualitative Study

  • Zheng An Toh; 
  • Per Koren Solvang; 
  • Hwee Weng Dennis Hey; 
  • Wei How Darryl ANG; 
  • Yan Pang; 
  • Kanokwan Hounsri; 
  • Tore Solberg; 
  • Bjørnar Berg; 
  • Margreth Grotle; 
  • Minna Pikkarainen; 
  • Hong-Gu He

ABSTRACT

Background:

Machine-learning-enabled prognostic models are increasingly proposed to support surgical decision-making for degenerative lumbar disorders, yet their clinical adoption remains limited. Understanding how surgeons perceive these tools is critical for effective implementation, particularly given the high-stakes nature of spine surgery where decisions carry long-term functional consequences and medico-legal implications.

Objective:

This study aimed to explore how consultant-level spine surgeons perceive machine-learning-based prognostic tools, including their trustworthiness, clinical utility, usability, workflow integration, and implications for patient counselling and shared decision-making.

Methods:

A qualitative study using semi-structured one-to-one interviews was conducted with 11 consultant-level orthopaedic and neurosurgeons practising in Singapore (response rate: 73% of 15 invited). Participants had a mean of 10.9 years (range: 8-25 years) of spine surgery experience; 82% (n=9) were male. Interviews (range: 15-57 minutes) were transcribed verbatim and analysed using Braun and Clarke's six-step reflexive thematic analysis. Data collection continued until information power was achieved, with three additional interviews completed after thematic sufficiency was reached as participants had already consented. The study was designed and reported in accordance with COREQ criteria.

Results:

Three overarching themes were developed. Trust contingent on data integrity revealed that surgeons' confidence depended fundamentally on data quality, local representativeness, labelling credibility, and rigorous validation, with participants consistently emphasising population representativeness as a trust prerequisite. Surgeons operationalised trust through accessible performance metrics, with most identifying AUROC as their preferred credibility heuristic. Pragmatic orientation as important for implementation demonstrated that usability and seamless electronic health record integration were non-negotiable prerequisites, with surgeons explicitly stating that manual data entry would preclude adoption. Medico-legal concerns were prominent, with participants emphasising that decision responsibility remains with the clinician. Surgical decision-making as a delicate dance between art and science reflected how prognostic outputs were positioned as adjunctive inputs to be reconciled with experiential judgement and patient heterogeneity. Surgeons emphasised that identical prognostic information would be interpreted differently based on practice philosophy, and most highlighted the inherent divergence between technical success and patient-perceived success, underscoring prognostic tools' value for expectation management rather than deterministic prediction.

Conclusions:

This first qualitative study of spine surgeons' perceptions reveals that adoption of machine-learning-based prognostic tools is contingent on data integrity, pragmatic workflow integration, and alignment with professional judgement and not predictive performance alone. Surgeons expressed cautious openness, viewing these tools as valuable in complex cases for clarifying outcome expectations without displacing clinical responsibility. Meaningful implementation requires robust data governance, contextually grounded validation, seamless electronic integration, and explicit positioning of machine learning as supportive rather than substitutive of surgical judgement. These findings provide empirically grounded guidance for developing clinically acceptable and implementation-ready prognostic decision support systems.


 Citation

Please cite as:

Toh ZA, Solvang PK, Hey HWD, ANG WHD, Pang Y, Hounsri K, Solberg T, Berg B, Grotle M, Pikkarainen M, He HG

Surgeons’ Perceptions of Machine-learning‑based Prognostic Tools for Spine Surgery: A Qualitative Study

JMIR Preprints. 20/02/2026:93810

DOI: 10.2196/preprints.93810

URL: https://preprints.jmir.org/preprint/93810

Download PDF


Request queued. Please wait while the file is being generated. It may take some time.

© The authors. All rights reserved. This is a privileged document currently under peer-review/community review (or an accepted/rejected manuscript). Authors have provided JMIR Publications with an exclusive license to publish this preprint on it's website for review and ahead-of-print citation purposes only. While the final peer-reviewed paper may be licensed under a cc-by license on publication, at this stage authors and publisher expressively prohibit redistribution of this draft paper other than for review purposes.