Maintenance Notice

Due to necessary scheduled maintenance, the JMIR Publications website will be unavailable from Wednesday, July 01, 2020 at 8:00 PM to 10:00 PM EST. We apologize in advance for any inconvenience this may cause you.

Who will be affected?

Accepted for/Published in: JMIR Research Protocols

Date Submitted: Aug 23, 2023
Date Accepted: Apr 18, 2024

The final, peer-reviewed published version of this preprint can be found here:

Detecting Algorithmic Errors and Patient Harms for AI-Enabled Medical Devices in Randomized Controlled Trials: Protocol for a Systematic Review

Kale AU, Hogg HDJ, Pearson R, Glocker B, Golder S, Coombe A, Waring J, Liu X, Moore DJ, Denniston AK

Detecting Algorithmic Errors and Patient Harms for AI-Enabled Medical Devices in Randomized Controlled Trials: Protocol for a Systematic Review

JMIR Res Protoc 2024;13:e51614

DOI: 10.2196/51614

PMID: 38941147

PMCID: 11245650

Detecting Algorithmic Errors and Patient Harms for Artificial Intelligence (AI) enabled Medical Devices in Randomised Controlled Trials: A systematic review protocol

  • Aditya U Kale; 
  • Henry David Jeffry Hogg; 
  • Russel Pearson; 
  • Ben Glocker; 
  • Su Golder; 
  • April Coombe; 
  • Justin Waring; 
  • Xiaoxuan Liu; 
  • David J Moore; 
  • Alastair K Denniston

ABSTRACT

Background:

AI health technologies have the potential to transform existing clinical workflows and ultimately improve patient outcomes. AI health technologies have shown potential for a range of clinical tasks such as diagnostics, prognostics, and therapeutic decision making such as drug dosing. There is however an urgent need to ensure that AI health technologies remain safe for all populations. Recent literature demonstrates the need for rigorous performance error analysis to identify issues such as algorithmic encoding of spurious correlations (e.g. protected characteristics), or specific failure modes that may lead to patient harm. Guidelines for reporting of studies evaluating AI health technologies (e.g. CONSORT-AI) require mention of performance error analysis, however there is still a lack of understanding around how performance errors should be analysed in clinical studies, and what harms authors should aim to detect and report.

Objective:

This systematic review will assess the frequency, severity of AI errors and patient harms in randomised controlled trials (RCTs) investigating AI interventions in clinical settings. The review will also explore how performance errors are analysed including whether analysis includes investigation of subgroup level outcomes.

Methods:

This systematic review will identify and select randomised controlled trials assessing AI interventions. Search strategies will be deployed in MEDLINE, EMBASE, Cochrane CENTRAL and clinical trials registries to identify relevant articles. RCTs identified in bibliographic databases will be cross-referenced with clinical trials registries. The primary outcomes of interest are the frequency and severity of AI errors, patient harms and reported adverse events. Quality assessment of RCTs will be based on RoB2. Data analysis will include comparison of error rates and patient harms between study arms and a meta-analysis of the rates of patient harm in control versus intervention arms will be conducted if appropriate.

Results:

The project was registered on PROSPERO in February 2023. Preliminary searches have been completed and the search strategy has been designed in consultation with an information specialist (see appendices 1 and 2). Abstract screening will start in August 2024.

Conclusions:

Evaluations of AI health technology have shown promising results, however reporting of studies has been variable. Detection, analysis and reporting of performance errors and patient harms is vital to robustly assess the safety of AI interventions in RCTs. Scoping searches have illustrated that reporting of harms is variable, often with no mention of adverse events. The findings of this systematic review will identify the frequency and severity of AI performance errors and patient harms, and generate insights into how errors should be analysed to account for both overall and subgroup performance. Systematic review registration PROSPERO CRD42023387747.


 Citation

Please cite as:

Kale AU, Hogg HDJ, Pearson R, Glocker B, Golder S, Coombe A, Waring J, Liu X, Moore DJ, Denniston AK

Detecting Algorithmic Errors and Patient Harms for AI-Enabled Medical Devices in Randomized Controlled Trials: Protocol for a Systematic Review

JMIR Res Protoc 2024;13:e51614

DOI: 10.2196/51614

PMID: 38941147

PMCID: 11245650

Download PDF


Request queued. Please wait while the file is being generated. It may take some time.

© The authors. All rights reserved. This is a privileged document currently under peer-review/community review (or an accepted/rejected manuscript). Authors have provided JMIR Publications with an exclusive license to publish this preprint on it's website for review and ahead-of-print citation purposes only. While the final peer-reviewed paper may be licensed under a cc-by license on publication, at this stage authors and publisher expressively prohibit redistribution of this draft paper other than for review purposes.