Maintenance Notice

Due to necessary scheduled maintenance, the JMIR Publications website will be unavailable from Wednesday, July 01, 2020 at 8:00 PM to 10:00 PM EST. We apologize in advance for any inconvenience this may cause you.

Who will be affected?

Accepted for/Published in: JMIR Formative Research

Date Submitted: May 7, 2025
Date Accepted: Nov 17, 2025

The final, peer-reviewed published version of this preprint can be found here:

Integrating GPT-4o Into Data Mining in Neurosurgery: Feasibility and Proof-of-Concept Study

Almeida Sales AH, Beck J, Grauvogel J

Integrating GPT-4o Into Data Mining in Neurosurgery: Feasibility and Proof-of-Concept Study

JMIR Form Res 2026;10:e77114

DOI: 10.2196/77114

Integrating GPT-4o for Data Mining in Neurosurgery: A Feasibility and Proof-of-Concept Study

  • Arthur Henrique Almeida Sales; 
  • Jürgen Beck; 
  • Jürgen Grauvogel

ABSTRACT

Background:

Artificial Intelligence (AI), particularly large language models (LLMs) like GPT-4, is increasingly being integrated into healthcare to enhance data mining capabilities, especially for unstructured medical records. The application of AI in neurosurgery has the potential to streamline data extraction, reduce operational inefficiencies, and improve clinical outcomes, but its effectiveness in extracting structured data from unstructured medical reports remains unclear.

Objective:

The aim of this proof-of-concept study was to evaluate the feasibility and accuracy of using GPT-4o for extracting structured data from unstructured neurosurgical reports, specifically focusing on patient data related to vestibular schwannomas (VS).

Methods:

This retrospective study included ten consecutive patients diagnosed with VS who underwent surgical treatment between August and December 2023. Raw medical reports, including discharge reports, surgery reports, histopathology reports, and 3-month follow-up reports, were provided to GPT-4o for data extraction. The AI’s performance in extracting structured data was evaluated by comparing its outputs with human expert evaluations. Data accuracy was assessed for both non-interpreted structured information (e.g., patient ID, tumor grade) and interpretive data (e.g., symptoms, complications, postoperative deficits).

Results:

GPT-4o demonstrated high accuracy in extracting straightforward data categories such as patient ID, date of surgery, histopathological diagnosis, and WHO tumor grade, with a 100% accuracy rate. However, extraction of variables like intraoperative complications and postoperative deficits initially showed lower accuracy (50%), which improved significantly (90-100%) after model adjustment using refined prompts.

Conclusions:

GPT-4o successfully extracted structured data from unstructured neurosurgical reports, achieving high accuracy for most data categories. This study highlights the potential of AI in enhancing clinical workflows, particularly in data mining and automated medical coding, which can reduce manual workloads and improve research and clinical operations. Further studies are required to refine these techniques and address regulatory and ethical concerns before widespread implementation.


 Citation

Please cite as:

Almeida Sales AH, Beck J, Grauvogel J

Integrating GPT-4o Into Data Mining in Neurosurgery: Feasibility and Proof-of-Concept Study

JMIR Form Res 2026;10:e77114

DOI: 10.2196/77114

Download PDF


Request queued. Please wait while the file is being generated. It may take some time.

© The authors. All rights reserved. This is a privileged document currently under peer-review/community review (or an accepted/rejected manuscript). Authors have provided JMIR Publications with an exclusive license to publish this preprint on it's website for review and ahead-of-print citation purposes only. While the final peer-reviewed paper may be licensed under a cc-by license on publication, at this stage authors and publisher expressively prohibit redistribution of this draft paper other than for review purposes.