JMIR Preprints #69700: Leveraging Artificial Intelligence to Optimize Maintenance of Health Evidence™, a One-stop Shop for Quality-appraised Evidence Syntheses on the Effectiveness of Public Health Interventions: A Quality Improvement Project

Current Preprint Settings

(as selected by the authors)

1. When the manuscript is submitted, allow peer review from:

(a) Anybody (open community peer review)
(b) Editor-selected reviewers (closed peer review)

2. When the manuscript is submitted, display the preprint PDF to:

(a) Anybody, anytime
(b) Logged-in users only
(c) Anybody, anytime (title and abstract only)
(d) No one

3. When the manuscript is accepted, display the accepted manuscript PDF to:

(a) Anybody, anytime
(b) Logged-in users only
(c) Anybody, anytime (title and abstract only)
(d) No one

Leveraging Artificial Intelligence to Optimize Maintenance of Health Evidence™, a One-stop Shop for Quality-appraised Evidence Syntheses on the Effectiveness of Public Health Interventions: A Quality Improvement Project

Kristin Rogers;
Alanna Miller;
Ashley Girgis;
Emily C Clark;
Sarah E Neil-Sztramko;
Maureen Dobbins

ABSTRACT

Background:

Health Evidence™ provides access to quality appraisals for >10,000 evidence syntheses on the effectiveness and cost-effectiveness of public health and health promotion interventions. Maintaining Health Evidence™ has become increasingly resource intensive due to the exponential growth of published literature. Innovative screening methods using Artificial intelligence (AI) can potentially improve efficiency.

Objective:

The objectives of this study are to: 1) assess the ability of AI-assisted screening to correctly predict non-relevant references at the title and abstract level and investigate the consistency of this performance over time, and 2) evaluate the impact of AI-assisted screening on the overall monthly manual screening set.

Methods:

Training and testing were conducted using the DistillerSR AI Preview & Rank feature. A set of manually screened references (n=43,273) was uploaded and used to train the AI feature and assign probability scores to each reference to predict relevance. A minimum threshold was established where the AI feature correctly identified all manually screened relevant references. The AI feature was tested on a separate set of references (n=72,686) from the May 2019 to April 2020 monthly searches. The testing set was used to determine an optimal threshold by assessing assigned probability scores until up to five false negative references were identified at the manual full-text screening level. This ensured >99% of relevant references would continue to be added to Health Evidence™. The performance of AI-assisted screening at the title and abstract screening level was evaluated using recall, specificity, precision, negative predictive value, and number of references removed by AI. The number and percentage of references removed by AI-assisted screening and change in monthly manual screening time was estimated using an implementation reference set (n=272,253) from November 2020 to 2023.

Results:

The minimum threshold in the training set of references was 0.068, which correctly removed 37% of non-relevant references. Analysis of the testing set identified an optimal threshold of 0.17, which removed 51,706 references (71.14%) using AI-assisted screening. A slight decrease in recall between the 0.068 minimum threshold (99.68%) and the 0.17 optimal threshold (94.84%) was noted, resulting in four missed references included via manual screening at the full-text level. This was accompanied by an increase in specificity from 35.95% to 71.70%, doubling the proportion of references AI-assisted screening correctly predicted as not relevant. Over three years of implementation, the number of references requiring manual screening was reduced by 70%, reducing the time spent manually screening by an estimated 382 hours.

Conclusions:

Given the magnitude of newly published peer-reviewed evidence, the curation of evidence supports decision-makers in making informed decisions. AI-assisted screening can be an important tool to supplement manual screening and reduce the number of references that require manual screening, ensuring the continued availability of curated high-quality synthesis evidence in public health is possible.

Citation

Please cite as:

Rogers K, Miller A, Girgis A, Clark EC, Neil-Sztramko SE, Dobbins M

Leveraging AI to Optimize Maintenance of Health Evidence and Offer a One-Stop Shop for Quality-Appraised Evidence Syntheses on the Effectiveness of Public Health Interventions: Quality Improvement Project

J Med Internet Res 2025;27:e69700

DOI: 10.2196/69700

PMID: 40729661

PMCID: 12306910

Download PDF

Request queued. Please wait while the file is being generated. It may take some time.

Copyright

© The authors. All rights reserved. This is a privileged document currently under peer-review/community review (or an accepted/rejected manuscript). Authors have provided JMIR Publications with an exclusive license to publish this preprint on it's website for review and ahead-of-print citation purposes only. While the final peer-reviewed paper may be licensed under a cc-by license on publication, at this stage authors and publisher expressively prohibit redistribution of this draft paper other than for review purposes.

JMIR Publications

JMIR Preprints

Accepted for/Published in: Journal of Medical Internet Research

Date Submitted: Dec 6, 2024

Date Accepted: Jun 5, 2025

Leveraging Artificial Intelligence to Optimize Maintenance of Health Evidence™, a One-stop Shop for Quality-appraised Evidence Syntheses on the Effectiveness of Public Health Interventions: A Quality Improvement Project

ABSTRACT

Citation

Copyright