Maintenance Notice

Due to necessary scheduled maintenance, the JMIR Publications website will be unavailable from Wednesday, July 01, 2020 at 8:00 PM to 10:00 PM EST. We apologize in advance for any inconvenience this may cause you.

Who will be affected?

Accepted for/Published in: Journal of Medical Internet Research

Date Submitted: May 20, 2020
Date Accepted: Sep 13, 2020
Date Submitted to PubMed: Sep 16, 2020

The final, peer-reviewed published version of this preprint can be found here:

Identification of Risk Factors and Symptoms of COVID-19: Analysis of Biomedical Literature and Social Media Data

Jeon J, Baruah G, Sarabadani S, Palanica A

Identification of Risk Factors and Symptoms of COVID-19: Analysis of Biomedical Literature and Social Media Data

J Med Internet Res 2020;22(10):e20509

DOI: 10.2196/20509

PMID: 32936770

PMCID: 7537723

Identification of risk factors and symptoms of SARS-CoV-2 (COVID-19) using biomedical literature and social media data: Integrative and Consensus study

  • Jouhyun Jeon; 
  • Gaurav Baruah; 
  • Sarah Sarabadani; 
  • Adam Palanica

ABSTRACT

Background:

In December 2019, Coronavirus disease 2019 (COVID-19) outbreak started in China and rapidly spread around the world. Lack of any vaccine or optimized intervention raised the importance of characterizing risk factors and symptoms for the early identification and successful treatment of COVID-19 patients.

Objective:

This study aims to investigate and analyze biomedical literature and public social media data to understand the association of risk factors and symptoms with various outcomes of COVID-19 patients.

Methods:

Through semantic analysis, we collected 45 retrospective cohort studies, which evaluated 303 clinical and demographic variables across 13 different outcomes of COVID-19 patients, and 84,140 Twitter posts from 1,036 COVID-19 positive users. Machine-learning tools to extract biomedical information were introduced to identify uncommon or novel symptoms mentioning in social media. We then examined and compared two datasets to expand our landscape of risk factors and symptoms related to COVID-19.

Results:

From the biomedical literature, approximately 90% of clinical and demographic variables showed inconsistent associations with COVID-19 outcomes. Consensus analysis identified 72 risk factors that were specifically associated with individual outcomes. From the social media data, 51 symptoms were characterized and analyzed. By comparing social media data with biomedical literature, we identified 25 novel symptoms that were specifically mentioned in social media but have been not previously well characterized. Furthermore, there were certain combinations of symptoms that were frequently mentioned together in social media.

Conclusions:

Identified outcome-specific risk factors, symptoms, and combinations of symptoms may serve as surrogate indicators to identify COVID-19 patients and predict their clinical outcomes providing appropriate treatments. Clinical Trial: NA


 Citation

Please cite as:

Jeon J, Baruah G, Sarabadani S, Palanica A

Identification of Risk Factors and Symptoms of COVID-19: Analysis of Biomedical Literature and Social Media Data

J Med Internet Res 2020;22(10):e20509

DOI: 10.2196/20509

PMID: 32936770

PMCID: 7537723

Download PDF


Request queued. Please wait while the file is being generated. It may take some time.

© The authors. All rights reserved. This is a privileged document currently under peer-review/community review (or an accepted/rejected manuscript). Authors have provided JMIR Publications with an exclusive license to publish this preprint on it's website for review and ahead-of-print citation purposes only. While the final peer-reviewed paper may be licensed under a cc-by license on publication, at this stage authors and publisher expressively prohibit redistribution of this draft paper other than for review purposes.