JMIR Preprints #18298: Automated Identification of Disease-Specific Clinical Outcomes Using Clinicaltrials.gov

Current Preprint Settings

(as selected by the authors)

1. When the manuscript is submitted, allow peer review from:

(a) Anybody (open community peer review)
(b) Editor-selected reviewers (closed peer review)

2. When the manuscript is submitted, display the preprint PDF to:

(a) Anybody, anytime
(b) Logged-in users only
(c) Anybody, anytime (title and abstract only)
(d) No one

3. When the manuscript is accepted, display the accepted manuscript PDF to:

(a) Anybody, anytime
(b) Logged-in users only
(c) Anybody, anytime (title and abstract only)
(d) No one

Automated Identification of Disease-Specific Clinical Outcomes Using Clinicaltrials.gov

Anas Elghafari;
Joseph Finkelstein

ABSTRACT

Background:

Common clinical outcomes are vital for ensuring comparability of clinical trial data and enabling meta analyses and inter-study comparisons. Traditionally, the process of deciding which outcomes should be recommended as common for a particular disease relied on assembling and surveying panels of subject-matter experts. This is usually a time-consuming and laborious process.

Objective:

The objectives of this work are to develop and evaluate a generalized pipeline that can automatically identify common outcomes specific to any given disease by finding, downloading, and analyzing data of previous clinical trials relevant to that disease.

Methods:

An automated pipeline to interface with ClinicalTrials.gov’s API and download the relevant trials for the input condition was designed. The primary and secondary outcomes of those trials were parsed and grouped based on text similarity and ranked based on frequency. The quality of the pipeline’s output was assessed by comparing the top outcomes identified by it for Chronic Obstructive Pulmonary Disease (COPD) to a list of 79 outcomes manually abstracted from 3 frequently cited expert reviews delineating clinical outcomes for COPD.

Results:

The pipeline successfully downloaded and processed 3,876 studies related to COPD. Manual verification indicated that the pipeline was downloading and processing the same number of trials as what was obtained from the self-service ClinicalTrials.gov portal. Evaluating the automatically identified outcomes against the manually abstracted ones showed the pipeline achieved recall of 91% and precision of 77%. Assessment of most frequent pipeline outcomes that were not included in the reviews indicated that they were relevant to COPD and could have been considered in future research.

Conclusions:

An automated, evidence-based pipeline can identify clinical outcomes of comparable breadth and quality as the outcomes identified by the reviews. Moreover, such an approach can highlight relevant outcomes for further consideration.

Citation

Please cite as:

Elghafari A, Finkelstein J

Automated Identification of Common Disease-Specific Outcomes for Comparative Effectiveness Research Using ClinicalTrials.gov: Algorithm Development and Validation Study

JMIR Med Inform 2021;9(2):e18298

DOI: 10.2196/18298

PMID: 33460388

PMCID: 7899806

Download PDF

Request queued. Please wait while the file is being generated. It may take some time.

Copyright

© The authors. All rights reserved. This is a privileged document currently under peer-review/community review (or an accepted/rejected manuscript). Authors have provided JMIR Publications with an exclusive license to publish this preprint on it's website for review and ahead-of-print citation purposes only. While the final peer-reviewed paper may be licensed under a cc-by license on publication, at this stage authors and publisher expressively prohibit redistribution of this draft paper other than for review purposes.

JMIR Publications

JMIR Preprints

Accepted for/Published in: JMIR Medical Informatics

Date Submitted: Feb 17, 2020

Date Accepted: Jan 17, 2021

Date Submitted to PubMed: Jan 18, 2021

Automated Identification of Disease-Specific Clinical Outcomes Using Clinicaltrials.gov

ABSTRACT

Citation

Copyright