Accepted for/Published in: Journal of Medical Internet Research
Date Submitted: May 20, 2020
Date Accepted: Sep 13, 2020
Date Submitted to PubMed: Sep 16, 2020
Identification of risk factors and symptoms of SARS-CoV-2 (COVID-19) using biomedical literature and social media data: Integrative and Consensus study
ABSTRACT
Background:
In December 2019, Coronavirus disease 2019 (COVID-19) outbreak started in China and rapidly spread around the world. Lack of any vaccine or optimized intervention raised the importance of characterizing risk factors and symptoms for the early identification and successful treatment of COVID-19 patients.
Objective:
This study aims to investigate and analyze biomedical literature and public social media data to understand the association of risk factors and symptoms with various outcomes of COVID-19 patients.
Methods:
Through semantic analysis, we collected 45 retrospective cohort studies, which evaluated 303 clinical and demographic variables across 13 different outcomes of COVID-19 patients, and 84,140 Twitter posts from 1,036 COVID-19 positive users. Machine-learning tools to extract biomedical information were introduced to identify uncommon or novel symptoms mentioning in social media. We then examined and compared two datasets to expand our landscape of risk factors and symptoms related to COVID-19.
Results:
From the biomedical literature, approximately 90% of clinical and demographic variables showed inconsistent associations with COVID-19 outcomes. Consensus analysis identified 72 risk factors that were specifically associated with individual outcomes. From the social media data, 51 symptoms were characterized and analyzed. By comparing social media data with biomedical literature, we identified 25 novel symptoms that were specifically mentioned in social media but have been not previously well characterized. Furthermore, there were certain combinations of symptoms that were frequently mentioned together in social media.
Conclusions:
Identified outcome-specific risk factors, symptoms, and combinations of symptoms may serve as surrogate indicators to identify COVID-19 patients and predict their clinical outcomes providing appropriate treatments. Clinical Trial: NA
Citation
Request queued. Please wait while the file is being generated. It may take some time.
Copyright
© The authors. All rights reserved. This is a privileged document currently under peer-review/community review (or an accepted/rejected manuscript). Authors have provided JMIR Publications with an exclusive license to publish this preprint on it's website for review and ahead-of-print citation purposes only. While the final peer-reviewed paper may be licensed under a cc-by license on publication, at this stage authors and publisher expressively prohibit redistribution of this draft paper other than for review purposes.