Maintenance Notice

Due to necessary scheduled maintenance, the JMIR Publications website will be unavailable from Wednesday, July 01, 2020 at 8:00 PM to 10:00 PM EST. We apologize in advance for any inconvenience this may cause you.

Who will be affected?

Accepted for/Published in: Journal of Medical Internet Research

Date Submitted: Sep 3, 2024
Date Accepted: Dec 4, 2024

The final, peer-reviewed published version of this preprint can be found here:

Forecasting the Incidence of Mumps Based on the Baidu Index and Environmental Data in Yunnan, China: Deep Learning Model Study

Xiong X, Xiang L, Chang L, Wu IX, Deng S

Forecasting the Incidence of Mumps Based on the Baidu Index and Environmental Data in Yunnan, China: Deep Learning Model Study

J Med Internet Res 2025;27:e66072

DOI: 10.2196/66072

PMID: 39913179

PMCID: 11843052

Forecasting the Incidence of Mumps based on Baidu index and Environmental Data in Yunnan, China: Deep Learning Model Study

  • Xin Xiong; 
  • Linghui Xiang; 
  • Litao Chang; 
  • Irene Xinyin Wu; 
  • Shuzhen Deng

ABSTRACT

Background:

Mumps is a viral respiratory disease characterized by facial swelling and transmitted through respiratory secretions. Despite the availability of an effective vaccine, mumps outbreaks have reemerged globally, including in China, where it remains a significant public health issue. In Yunnan province, China, the incidence of mumps has fluctuated markedly, underscoring the need for improved outbreak prediction methods. Traditional surveillance methods, however, may not be sufficient for timely and accurate outbreak prediction.

Objective:

Our study aims to leverage the Baidu search index, representing search volumes from China's most popular search engine, along with environmental data to develop a predictive model for mumps incidence in Yunnan province.

Methods:

We analyzed the incidence of mumps in Yunnan Province from 2014 to 2023, and used time series data, including mumps incidence, Baidu search index, and environmental factors, from 2016 to 2023, to develop predictive models based on long short-term memory networks. Feature selection was conducted using Pearson correlation analysis, and lag correlations were explored through a distributed nonlinear lag model (DNLM). We constructed four models with different combinations of predictors: (1) model BE, combining Baidu index and environmental factors data; (2) model IB, combining mumps incidence and Baidu index data; (3) model IE, combining mumps incidence and environmental factors; and (4) model IBE, integrating all three data sources.

Results:

The incidence of mumps in Yunnan showed significant variability, peaking at 37.5 per 100,000 in 2019. From 2014 to 2023, the proportion of female patients ranged from 41.3% in 2015 to 45.7% in 2020, consistently lower than that of males. After excluding variable with Pearson correlation coefficient (PCC) <0.10 or p values <0.05, we included three Baidu index of search term groups (disease name, symptoms and treatment) and six environmental factors (maximum temperature, minimum temperature, SO2, CO, PM 2.5 and PM 10) for model development. DNLM analysis revealed that the relative risks consistently increased with rising Baidu index values, while non-linear associations between temperature and mumps incidence were observed. Among the four models, model IBE exhibited the best performance, achieving the coefficient of determination of 0.72, with mean absolute error, mean absolute percentage error, and root mean square error values of 0.33, 15.9%, and 0.43, respectively, in the test set.

Conclusions:

Our study developed the model IBE to predict the incidence of mumps in Yunnan province, offering a potential tool for early detection of mumps outbreaks. The performance of model IBE underscores the potential of integrating search engine data and environmental factors to enhance mumps incidence forecasting. This approach offers a promising tool for improving public health surveillance and enabling rapid responses to mumps outbreaks in regions with variable disease patterns like Yunnan.


 Citation

Please cite as:

Xiong X, Xiang L, Chang L, Wu IX, Deng S

Forecasting the Incidence of Mumps Based on the Baidu Index and Environmental Data in Yunnan, China: Deep Learning Model Study

J Med Internet Res 2025;27:e66072

DOI: 10.2196/66072

PMID: 39913179

PMCID: 11843052

Download PDF


Request queued. Please wait while the file is being generated. It may take some time.

© The authors. All rights reserved. This is a privileged document currently under peer-review/community review (or an accepted/rejected manuscript). Authors have provided JMIR Publications with an exclusive license to publish this preprint on it's website for review and ahead-of-print citation purposes only. While the final peer-reviewed paper may be licensed under a cc-by license on publication, at this stage authors and publisher expressively prohibit redistribution of this draft paper other than for review purposes.