Maintenance Notice

Due to necessary scheduled maintenance, the JMIR Publications website will be unavailable from Wednesday, July 01, 2020 at 8:00 PM to 10:00 PM EST. We apologize in advance for any inconvenience this may cause you.

Who will be affected?

Accepted for/Published in: Journal of Medical Internet Research

Date Submitted: Mar 17, 2021
Date Accepted: May 1, 2021
Date Submitted to PubMed: Aug 12, 2021

The final, peer-reviewed published version of this preprint can be found here:

Forecasting the COVID-19 Epidemic by Integrating Symptom Search Behavior Into Predictive Models: Infoveillance Study

Rabiolo A, Alladio E, Morales E, McNaught AI, Bandello F, Afifi AA, Marchese A

Forecasting the COVID-19 Epidemic by Integrating Symptom Search Behavior Into Predictive Models: Infoveillance Study

J Med Internet Res 2021;23(8):e28876

DOI: 10.2196/28876

PMID: 34156966

PMCID: 8360333

Forecasting the COVID-19 epidemic integrating symptom search behavior: an infoveillance study

  • Alessandro Rabiolo; 
  • Eugenio Alladio; 
  • Esteban Morales; 
  • Andrew Ian McNaught; 
  • Francesco Bandello; 
  • Abdelmonem A Afifi; 
  • Alessandro Marchese

ABSTRACT

Background:

Previous studies have suggested associations between trends of web searches and COVID-19 traditional metrics. It remains unclear whether models incorporating trends of digital searches lead to better predictions.

Objective:

To investigated the relationship between Google Trends searches of symptoms associated with COVID-19 and confirmed COVID-19 cases and deaths. To develop predictive models to forecast COVID-19 epidemic based on the combination of Google Trends searches of symptoms and conventional COVID-19 metrics.

Methods:

An open-access web application was developed to evaluate Google Trends and traditional COVID-19 metrics via an interactive framework based on principal components analysis (PCA) and time series modelling. The app facilitates the analysis of symptom search behavior associated with COVID-19 disease in 188 countries. In this study, we selected data of eight countries as case studies to represent all continents. PCA was used to perform data dimensionality reduction, and three different time series models (Error Trend Seasonality, Autoregressive integrated moving average, and feed-forward neural network autoregression) were used to predict COVID-19 metrics in the upcoming 14 days. The models were compared in terms of prediction ability using the root-mean-square error (RMSE) of the first principal component (PC1). Predictive ability of models generated with both Google Trends data and conventional COVID-19 metrics were compared with those fitted with conventional COVID-19 metrics only.

Results:

The degree of correlation and the best time-lag varied as a function of the selected country and topic searched; in general, the optimal time-lag was within 15 days. Overall, predictions of PC1 based on both searched termed and COVID-19 traditional metrics performed better than those not including Google searches (median [IQR]: 1.43 [0.74-2.36] vs. 1.78 [0.95-2.88], respectively), but the improvement in prediction varied as a function of the selected country and timeframe. The best model varied as a function of country, time range, and period of time selected. Models based on a 7-day moving average led to considerably smaller RMSE values as opposed to those calculated with raw data (median [IQR]: 0.74 [0.47-1.22] vs. 2.15 [1.55-3.89], respectively).

Conclusions:

The inclusion of digital online searches in statistical models may improve the nowcasting and forecasting of COVID-19 epidemic, and could be used as one of the surveillance systems of COVID-19 disease. We provide a free web-application operating with nearly real-time data that can be used by any user to make predictions of outbreaks, improve estimates of dynamics of ongoing epidemics, and anticipate future or rebound waves.


 Citation

Please cite as:

Rabiolo A, Alladio E, Morales E, McNaught AI, Bandello F, Afifi AA, Marchese A

Forecasting the COVID-19 Epidemic by Integrating Symptom Search Behavior Into Predictive Models: Infoveillance Study

J Med Internet Res 2021;23(8):e28876

DOI: 10.2196/28876

PMID: 34156966

PMCID: 8360333

Download PDF


Request queued. Please wait while the file is being generated. It may take some time.

© The authors. All rights reserved. This is a privileged document currently under peer-review/community review (or an accepted/rejected manuscript). Authors have provided JMIR Publications with an exclusive license to publish this preprint on it's website for review and ahead-of-print citation purposes only. While the final peer-reviewed paper may be licensed under a cc-by license on publication, at this stage authors and publisher expressively prohibit redistribution of this draft paper other than for review purposes.