Maintenance Notice

Due to necessary scheduled maintenance, the JMIR Publications website will be unavailable from Wednesday, July 01, 2020 at 8:00 PM to 10:00 PM EST. We apologize in advance for any inconvenience this may cause you.

Who will be affected?

Accepted for/Published in: JMIR Medical Informatics

Date Submitted: Jul 23, 2024
Date Accepted: Jan 28, 2025

The final, peer-reviewed published version of this preprint can be found here:

GPT-3.5 Turbo and GPT-4 Turbo in Title and Abstract Screening for Systematic Reviews

Oami T, Okada Y, Nakada Ta

GPT-3.5 Turbo and GPT-4 Turbo in Title and Abstract Screening for Systematic Reviews

JMIR Med Inform 2025;13:e64682

DOI: 10.2196/64682

PMID: 40073422

PMCID: 11922487

GPT-3.5 Turbo and GPT-4 Turbo in Title and Abstract Screening for Systematic Reviews

  • Takehiko Oami; 
  • Yohei Okada; 
  • Taka-aki Nakada

ABSTRACT

This study compared the accuracy and efficiency of GPT-3.5 Turbo and GPT-4 Turbo in citation screening for systematic reviews in critical care. We used the data from the Japanese Clinical Practice Guidelines for Management of Sepsis and Septic Shock 2024. GPT-4 Turbo demonstrated superior specificity (0.98) compared to GPT-3.5 Turbo (0.51), with comparable sensitivity (0.85 vs. 0.83). GPT-3.5 Turbo processed 100 studies slightly faster than GPT-4 Turbo (0.9 vs. 1.6 minutes). GPT-4 Turbo may be more suitable in screening citations due to its higher specificity. This study highlights the potential of large language models in optimizing literature selection processes.


 Citation

Please cite as:

Oami T, Okada Y, Nakada Ta

GPT-3.5 Turbo and GPT-4 Turbo in Title and Abstract Screening for Systematic Reviews

JMIR Med Inform 2025;13:e64682

DOI: 10.2196/64682

PMID: 40073422

PMCID: 11922487

Download PDF


Request queued. Please wait while the file is being generated. It may take some time.

© The authors. All rights reserved. This is a privileged document currently under peer-review/community review (or an accepted/rejected manuscript). Authors have provided JMIR Publications with an exclusive license to publish this preprint on it's website for review and ahead-of-print citation purposes only. While the final peer-reviewed paper may be licensed under a cc-by license on publication, at this stage authors and publisher expressively prohibit redistribution of this draft paper other than for review purposes.