Maintenance Notice

Due to necessary scheduled maintenance, the JMIR Publications website will be unavailable from Wednesday, July 01, 2020 at 8:00 PM to 10:00 PM EST. We apologize in advance for any inconvenience this may cause you.

Who will be affected?

Accepted for/Published in: JMIR Public Health and Surveillance

Date Submitted: Nov 20, 2018
Open Peer Review Period: Nov 21, 2018 - Dec 5, 2018
Date Accepted: Apr 3, 2020
Date Submitted to PubMed: Jul 23, 2020
(closed for review but you can still tweet)

The final, peer-reviewed published version of this preprint can be found here:

An Automated Approach for Finding Spatio-Temporal Patterns of Seasonal Influenza in the United States: Algorithm Validation Study

Sambaturu P, Bhattacharya P, Chen J, Lewis B, Marathe M, Venkataramanan S, Vullikanti A

An Automated Approach for Finding Spatio-Temporal Patterns of Seasonal Influenza in the United States: Algorithm Validation Study

JMIR Public Health Surveill 2020;6(3):e12842

DOI: 10.2196/12842

PMID: 32701458

PMCID: 7501584

An automated approach for finding spatio-temporal patterns in disease spread

  • Prathyush Sambaturu; 
  • Parantapa Bhattacharya; 
  • Jiangzhuo Chen; 
  • Bryan Lewis; 
  • Madhav Marathe; 
  • Srinivasan Venkataramanan; 
  • Anil Vullikanti

ABSTRACT

Background:

Agencies such as the Centers for Disease Control (CDC) currently release incidence data (e.g., Influenza), along with descriptive summaries of simple spatio-temporal patterns and trends. However, public health researchers, government agencies, as well as the general public, are often interested in deeper patterns and insights into how the disease is spreading. Analysis by domain experts is needed for deriving such insights from incidence data.

Objective:

Our goal is to develop an automated approach for finding interesting spatio-temporal patterns in the spread of a disease over a large region, such as: regions which have specific characteristics, e.g., high incidence in a particular week, those which showed a sudden change in incidence, or regions which have significantly different incidence compared to earlier seasons.

Methods:

We develop techniques from the area of transactional data mining for characterizing and finding interesting spatio-temporal patterns in disease spread in an automated manner. A key part of our approach involves using the principle of minimum description length for characterizing a set of regions in terms of combinations of attributes; we use integer programming to find such descriptions. Our automated approach explores regions that have different kinds of temporal patterns, and ranks them based on their description length.

Results:

We apply our methods for finding spatio-temporal patterns in the spread of seasonal Influenza in the US at the resolution of states. We find succinct descriptions for regions (sets of states) with specific characteristics, e.g., high activity level, which give better insight into such regions. Our approach also finds interesting patterns in the form of regions exhibiting significant changes in activity levels in a short time, and in terms of activity levels in the past seasons.

Conclusions:

Our approach can provide new insights into the patterns and trends in disease spread in an automated manner. Our results show that the description complexity is an effective approach for characterizing sets of interest, which can be easily extended to other diseases and regions, beyond Influenza in the US. The patterns we find have a specific structure, which can be easily adapted for automated generation of narratives.


 Citation

Please cite as:

Sambaturu P, Bhattacharya P, Chen J, Lewis B, Marathe M, Venkataramanan S, Vullikanti A

An Automated Approach for Finding Spatio-Temporal Patterns of Seasonal Influenza in the United States: Algorithm Validation Study

JMIR Public Health Surveill 2020;6(3):e12842

DOI: 10.2196/12842

PMID: 32701458

PMCID: 7501584

Per the author's request the PDF is not available.

© The authors. All rights reserved. This is a privileged document currently under peer-review/community review (or an accepted/rejected manuscript). Authors have provided JMIR Publications with an exclusive license to publish this preprint on it's website for review and ahead-of-print citation purposes only. While the final peer-reviewed paper may be licensed under a cc-by license on publication, at this stage authors and publisher expressively prohibit redistribution of this draft paper other than for review purposes.