Accepted for/Published in: JMIR Formative Research
Date Submitted: Mar 30, 2025
Open Peer Review Period: Apr 7, 2025 - May 5, 2025
Date Accepted: May 21, 2025
Date Submitted to PubMed: May 23, 2025
(closed for review but you can still tweet)
Warning: This is an author submission that is not peer-reviewed or edited. Preprints - unless they show as "accepted" - should not be relied on to guide clinical practice or health-related behavior and should not be reported in news media as established information.
The Umbrella Collaboration®: An Innovative Tertiary Evidence Synthesis Methodology. Validation Study
ABSTRACT
Background:
The synthesis of evidence in healthcare is essential for informed decision-making and policy development. This study aims to validate The Umbrella Collaboration® (TU®), an innovative, semi-automatic tertiary evidence synthesis methodology, by comparing it with Traditional Umbrella Reviews (TUR), which are currently the gold standard.
Objective:
The primary objective of this study is to evaluate whether TU®, an AI-assisted, software-driven system for tertiary evidence synthesis, can achieve comparable effectiveness to TURs, while offering a more timely, efficient, and comprehensive approach.
Methods:
This comparative study evaluated TU® against TURs across eight matched projects in geriatrics. For each selected TUR, a parallel TU® project was conducted using the same research question. Outcomes of interest (OoIs), effect sizes, certainty ratings, and execution times were systematically compared. Effect sizes were assessed both quantitatively, by transforming TUR metrics to Cohen’s d and correlating them with TU®’s RTU metric, and qualitatively, through categorical classifications (trivial, small, moderate, large). Certainty levels were compared by mapping GRADE ratings and TU®’s sentiment analysis scores onto a common 0–1 scale. Execution time was measured precisely in TU®, while TUR durations were estimated from literature benchmarks. Statistical analyses included chi-squared tests and Spearman correlations.
Results:
Eight TURs in geriatrics were matched with parallel projects using TU®. TU® replicated 84.9% (73/86) of the OoIs identified by TURs and reported an additional 337 OoIs, representing a 4.77-fold increase in outcome identification. In the comparison of effect size classifications, full concordance was observed in 50.0% of cases and consistent concordance (full plus one-level deviation) in 93.8%, with a moderate strength of association (Cramér’s V = 0.339). The correlation of transformed certainty values between TU® and GRADE yielded a statistically significant Spearman coefficient (ρ = 0.446; P = .025). The average execution time per TU® project was 4 hours and 46 minutes, compared to estimated durations of 6–12 months for TURs.
Conclusions:
The Umbrella Collaboration® demonstrated high concordance with TURs, replicating 84.9% of the outcomes identified by TURs and identifying nearly five times as many additional outcomes. The experimental effect size metric (RTU) showed moderate agreement with conventional measures, and the certainty ratings derived from sentiment analysis correlated acceptably with GRADE-based assessments. While further validation is needed, TU® appears to be a valid and efficient approach for tertiary evidence synthesis, offering a scalable and time-efficient alternative when rapid results are required.
Citation
Request queued. Please wait while the file is being generated. It may take some time.
Copyright
© The authors. All rights reserved. This is a privileged document currently under peer-review/community review (or an accepted/rejected manuscript). Authors have provided JMIR Publications with an exclusive license to publish this preprint on it's website for review and ahead-of-print citation purposes only. While the final peer-reviewed paper may be licensed under a cc-by license on publication, at this stage authors and publisher expressively prohibit redistribution of this draft paper other than for review purposes.