JMIR Preprints #29398: Auto Machine Learning Approaches to Refine the Identification of High-Quality Clinical Research Articles from The Biomedical Literature: A Study Protocol

Current Preprint Settings

(as selected by the authors)

1. When the manuscript is submitted, allow peer review from:

(a) Anybody (open community peer review)
(b) Editor-selected reviewers (closed peer review)

2. When the manuscript is submitted, display the preprint PDF to:

(a) Anybody, anytime
(b) Logged-in users only
(c) Anybody, anytime (title and abstract only)
(d) No one

3. When the manuscript is accepted, display the accepted manuscript PDF to:

(a) Anybody, anytime
(b) Logged-in users only
(c) Anybody, anytime (title and abstract only)
(d) No one

Auto Machine Learning Approaches to Refine the Identification of High-Quality Clinical Research Articles from The Biomedical Literature: A Study Protocol

Wael Abdelkader;
Tamara Navarro;
Rick Parrish;
Chris Cotoi;
Federico Germini;
Lori-Ann Linkins;
Alfonso Iorio;
Brian Haynes;
Cynthia Lokker

ABSTRACT

Due to the continued rapid growth in published biomedical literature, it is increasingly difficult to identify and retrieve high-quality evidence. Machine learning approaches have been applied to address this issue. Some models developed using supervised machine learning approaches have achieved high sensitivity or recall, however precision has been variable. In a series of experiments, we will assess the performance of machine learning models to retrieve high-quality, high relevance evidence for clinical consideration from the biomedical literature. The models will be trained using an automated approach applied to a database of almost 100, 000 articles that have been tagged by highly trained research staff based on criteria for high-quality and assessed for clinical relevance by clinicians. We will evaluate and report on the effects of various classifiers, preprocessing steps, feature selection, and the use of balanced vs unbalanced datasets applied during model development on the performance of the derived supervised machine learning models. The series was devised to improve the precision of the retrieval of high-quality articles by applying a machine learning classifier sequentially after using high sensitivity Boolean search filters to an ongoing literature surveillance process. Our multi-level analysis of the various steps of machine learning model development will help expand the existing knowledge base on the effect of each step on the performance of machine learning models.

Citation

Please cite as:

Abdelkader W, Navarro T, Parrish R, Cotoi C, Germini F, Linkins LA, Iorio A, Haynes B, Lokker C

A Deep Learning Approach to Refine the Identification of High-Quality Clinical Research Articles From the Biomedical Literature: Protocol for Algorithm Development and Validation

JMIR Res Protoc 2021;10(11):e29398

DOI: 10.2196/29398

PMID: 34847061

PMCID: 8669577

Download PDF

Request queued. Please wait while the file is being generated. It may take some time.

Copyright

© The authors. All rights reserved. This is a privileged document currently under peer-review/community review (or an accepted/rejected manuscript). Authors have provided JMIR Publications with an exclusive license to publish this preprint on it's website for review and ahead-of-print citation purposes only. While the final peer-reviewed paper may be licensed under a cc-by license on publication, at this stage authors and publisher expressively prohibit redistribution of this draft paper other than for review purposes.

JMIR Publications

JMIR Preprints

Accepted for/Published in: JMIR Research Protocols

Date Submitted: Apr 7, 2021

Open Peer Review Period: Apr 7, 2021 - Jun 2, 2021

Date Accepted: Sep 17, 2021

Date Submitted to PubMed: Nov 30, 2021

(closed for review but you can still tweet)

Auto Machine Learning Approaches to Refine the Identification of High-Quality Clinical Research Articles from The Biomedical Literature: A Study Protocol

ABSTRACT

Citation

Copyright