Maintenance Notice

Due to necessary scheduled maintenance, the JMIR Publications website will be unavailable from Wednesday, July 01, 2020 at 8:00 PM to 10:00 PM EST. We apologize in advance for any inconvenience this may cause you.

Who will be affected?

Accepted for/Published in: JMIR Research Protocols

Date Submitted: Apr 7, 2021
Open Peer Review Period: Apr 7, 2021 - Jun 2, 2021
Date Accepted: Sep 17, 2021
Date Submitted to PubMed: Nov 30, 2021
(closed for review but you can still tweet)

The final, peer-reviewed published version of this preprint can be found here:

A Deep Learning Approach to Refine the Identification of High-Quality Clinical Research Articles From the Biomedical Literature: Protocol for Algorithm Development and Validation

Abdelkader W, Navarro T, Parrish R, Cotoi C, Germini F, Linkins LA, Iorio A, Haynes B, Lokker C

A Deep Learning Approach to Refine the Identification of High-Quality Clinical Research Articles From the Biomedical Literature: Protocol for Algorithm Development and Validation

JMIR Res Protoc 2021;10(11):e29398

DOI: 10.2196/29398

PMID: 34847061

PMCID: 8669577

Warning: This is an author submission that is not peer-reviewed or edited. Preprints - unless they show as "accepted" - should not be relied on to guide clinical practice or health-related behavior and should not be reported in news media as established information.

Auto Machine Learning Approaches to Refine the Identification of High-Quality Clinical Research Articles from The Biomedical Literature: A Study Protocol

  • Wael Abdelkader; 
  • Tamara Navarro; 
  • Rick Parrish; 
  • Chris Cotoi; 
  • Federico Germini; 
  • Lori-Ann Linkins; 
  • Alfonso Iorio; 
  • Brian Haynes; 
  • Cynthia Lokker

ABSTRACT

Due to the continued rapid growth in published biomedical literature, it is increasingly difficult to identify and retrieve high-quality evidence. Machine learning approaches have been applied to address this issue. Some models developed using supervised machine learning approaches have achieved high sensitivity or recall, however precision has been variable. In a series of experiments, we will assess the performance of machine learning models to retrieve high-quality, high relevance evidence for clinical consideration from the biomedical literature. The models will be trained using an automated approach applied to a database of almost 100, 000 articles that have been tagged by highly trained research staff based on criteria for high-quality and assessed for clinical relevance by clinicians. We will evaluate and report on the effects of various classifiers, preprocessing steps, feature selection, and the use of balanced vs unbalanced datasets applied during model development on the performance of the derived supervised machine learning models. The series was devised to improve the precision of the retrieval of high-quality articles by applying a machine learning classifier sequentially after using high sensitivity Boolean search filters to an ongoing literature surveillance process. Our multi-level analysis of the various steps of machine learning model development will help expand the existing knowledge base on the effect of each step on the performance of machine learning models.


 Citation

Please cite as:

Abdelkader W, Navarro T, Parrish R, Cotoi C, Germini F, Linkins LA, Iorio A, Haynes B, Lokker C

A Deep Learning Approach to Refine the Identification of High-Quality Clinical Research Articles From the Biomedical Literature: Protocol for Algorithm Development and Validation

JMIR Res Protoc 2021;10(11):e29398

DOI: 10.2196/29398

PMID: 34847061

PMCID: 8669577

Download PDF


Request queued. Please wait while the file is being generated. It may take some time.

© The authors. All rights reserved. This is a privileged document currently under peer-review/community review (or an accepted/rejected manuscript). Authors have provided JMIR Publications with an exclusive license to publish this preprint on it's website for review and ahead-of-print citation purposes only. While the final peer-reviewed paper may be licensed under a cc-by license on publication, at this stage authors and publisher expressively prohibit redistribution of this draft paper other than for review purposes.