Accepted for/Published in: JMIR Public Health and Surveillance
Date Submitted: Apr 29, 2020
Open Peer Review Period: Apr 29, 2020 - May 4, 2020
Date Accepted: May 13, 2020
Date Submitted to PubMed: May 15, 2020
(closed for review but you can still tweet)
Warning: This is an author submission that is not peer-reviewed or edited. Preprints - unless they show as "accepted" - should not be relied on to guide clinical practice or health-related behavior and should not be reported in news media as established information.
Correlations between Real-World COVID-19 Incidence and Search Engine Queries in Google and Baidu: A Time-Series Analysis
ABSTRACT
Background:
The COVID-19 pandemic has the distinction of being the first pandemic to occur in the digital age. With the Internet harvesting large amounts of data from the general population in real-time, public databases such as Google Trends and the Baidu Index can be an expedient tool to assist public health efforts.
Objective:
To apply digital epidemiology to the current COVID-19 pandemic to determine utility in the providing adjunctive epidemiologic information on outbreaks of this disease and evaluate this methodology in the case of future pandemics.
Methods:
An epidemiologic time-series analysis of online search trends relating to the COVID-19 pandemic was performed from January 9, 2020 to April 6, 2020. These data were compared to real-world confirmed cases and deaths of COVID-19. Chronologic patterns were assessed in relation to disease patterns, significant events, and media reports.
Results:
Worldwide search terms for shortness of breath, anosmia, dysgeusia/ageusia, headache, chest pain, and sneezing had strong correlations (r>.6, P<.001) to both new daily confirmed cases and deaths from COVID-19. COVID-19 searches predated RW confirmed cases by 12 days (r=.85±.10, P<.001). Searches for symptoms of diarrhea, fever, shortness of breath, cough, nasal obstruction, and rhinorrhea all had a negative lag of greater than one week compared to new daily cases; whereas, searches for anosmia and dysgeusia peaked worldwide and in China with positive lags of 5 days and 6 weeks, respectively, corresponding with widespread media coverage of these symptoms in COVID-19.
Conclusions:
This study demonstrates the utility of digital epidemiology in providing helpful surveillance data of disease outbreaks like COVID-19. While certain online search trends for this disease were influenced by media coverage, many search terms reflected clinical manifestations of the disease and showed strong correlations with real-world cases and deaths.
Citation
Request queued. Please wait while the file is being generated. It may take some time.
Copyright
© The authors. All rights reserved. This is a privileged document currently under peer-review/community review (or an accepted/rejected manuscript). Authors have provided JMIR Publications with an exclusive license to publish this preprint on it's website for review and ahead-of-print citation purposes only. While the final peer-reviewed paper may be licensed under a cc-by license on publication, at this stage authors and publisher expressively prohibit redistribution of this draft paper other than for review purposes.