Maintenance Notice

Due to necessary scheduled maintenance, the JMIR Publications website will be unavailable from Wednesday, July 01, 2020 at 8:00 PM to 10:00 PM EST. We apologize in advance for any inconvenience this may cause you.

Who will be affected?

Accepted for/Published in: Journal of Medical Internet Research

Date Submitted: Jul 30, 2024
Date Accepted: Jan 18, 2025

The final, peer-reviewed published version of this preprint can be found here:

Public Disclosure of Results From Artificial Intelligence/Machine Learning Research in Health Care: Comprehensive Analysis of ClinicalTrials.gov, PubMed, and Scopus Data (2010-2023)

Maru S, Kuwatsuru R, Matthias MD, Simpson RJ Jr

Public Disclosure of Results From Artificial Intelligence/Machine Learning Research in Health Care: Comprehensive Analysis of ClinicalTrials.gov, PubMed, and Scopus Data (2010-2023)

J Med Internet Res 2025;27:e60148

DOI: 10.2196/60148

PMID: 40117574

PMCID: 11971578

Warning: This is an author submission that is not peer-reviewed or edited. Preprints - unless they show as "accepted" - should not be relied on to guide clinical practice or health-related behavior and should not be reported in news media as established information.

Public Disclosure of Results from Artificial Intelligence/Machine Learning Research in the Past 14 Years: Cross-sectional Analysis

  • Shoko Maru; 
  • Ryohei Kuwatsuru; 
  • Michael D Matthias; 
  • Ross Joseph Simpson Jr

ABSTRACT

Background:

While systematic reviews can identify reported results of artificial intelligence/machine learning (AI/ML) research, they only represent the numerator of the total research conducted.

Objective:

To estimate the proportions and patterns of result disclosures of AI/ML research on ClinicalTrials.gov and peer-reviewed publications indexed in PubMed or Scopus, and to reveal the extent of undisclosed results years after study completion.

Methods:

Using data from the Clinical Trials Transformation Initiative Aggregate Analysis of ClinicalTrials.gov, we identified studies that started and completed between January 2010 to December 2023 and contained AI/ML-specific terms in study descriptions, official title, brief summary, interventions, conditions, detailed descriptions, or sponsors’ keywords. For 842 completed studies, we searched PubMed and Scopus for publications that included study identifiers and AI/ML-specific terms within the titles or abstracts. We calculated disclosure rates within 3 years of study completion and the median time to disclosure from the ‘primary completion date’ to the ‘results first posted date’ on ClinicalTrial.gov or the earliest date of journal publication.

Results:

Of 842 completed studies (357 interventional; 485 observational), 5.5% (46/842) disclosed results on ClinicalTrials.gov, 13.3% (112/842) in journal publications, and 17.2% (145/842) through either dissemination route within 3 years of completion. The disclosure rates were higher among trials: 10.4% (37/357) on ClinicalTrials.gov, 18.8% (67/357) in journal publications, and 25.8% (92/357) through either route. Rates were also higher among randomized controlled trials: 11.3% (23/203) on ClinicalTrials.gov, 23.6% (48/203) in journal publications, and 31.5% (64/203) through either route. Nevertheless, most study findings (82.8%; 697/842) remained undisclosed after 3 years of completion. Trials employing randomization (vs. nonrandomized) or masking (vs. open-label) had higher disclosure rates and shorter times to disclosure. Although 85% (305/357) of completed trials had sample sizes of ≤1000, larger trials (n>1000) had higher publication rates (28.8%; 15/52) than smaller trials (n≤1000) (17.0%; 52/305). Conversely, large observational studies (n>1000) published less (6.7%; 9/134) than smaller studies (10.3%; 36/351). The most published sponsors were academia (14.7%; 38/259) and hospitals (12.4%; 42/340), followed by industry (11.0%; 16/146). Although high-income countries accounted for 81.9% (86/105) of published studies, the publication rates per completed study did not greatly differ among high-income (14.0%; 86/613), upper-middle (11.5%; 16/139), and lower-middle countries (13.0%; 3/23). Of studies with disclosed results, the median times to report through ClinicalTrials.gov and in journal publications were 505 days (IQR: 399-676) and 411 days (IQR: 274-676), respectively. Single-center studies were more common than multicenter studies for both trials (83.3%; 290/348) and observational studies (82.3%; 377/458).

Conclusions:

Of registered AI/ML studies completed during 2010-2023, over 80% of the findings remained undisclosed three years after study completion. While methodological rigor was generally associated with higher publication rates, a predominance of single-center designs was notable in both interventional and observational studies.


 Citation

Please cite as:

Maru S, Kuwatsuru R, Matthias MD, Simpson RJ Jr

Public Disclosure of Results From Artificial Intelligence/Machine Learning Research in Health Care: Comprehensive Analysis of ClinicalTrials.gov, PubMed, and Scopus Data (2010-2023)

J Med Internet Res 2025;27:e60148

DOI: 10.2196/60148

PMID: 40117574

PMCID: 11971578

Per the author's request the PDF is not available.