Maintenance Notice

Due to necessary scheduled maintenance, the JMIR Publications website will be unavailable from Wednesday, July 01, 2020 at 8:00 PM to 10:00 PM EST. We apologize in advance for any inconvenience this may cause you.

Who will be affected?

Previously submitted to: Journal of Medical Internet Research (no longer under consideration since Oct 07, 2021)

Date Submitted: Aug 18, 2020

Warning: This is an author submission that is not peer-reviewed or edited. Preprints - unless they show as "accepted" - should not be relied on to guide clinical practice or health-related behavior and should not be reported in news media as established information.

RAPIDS: Reproducible Analysis Pipeline for Data Streams Collected with Mobile Devices

  • Julio Vega; 
  • Meng Li; 
  • Kwesi Aguillera; 
  • Nikunj Goel; 
  • Echhit Joshi; 
  • Krina C Durica; 
  • Abhineeth Reddy Kunta; 
  • Carissa A Low

ABSTRACT

Background:

Smartphone and wearable devices are widely used in behavioral and clinical research to collect longitudinal data that, along with ground truth data, are used to create models of human behavior. Mobile sensing researchers often program analysis code from scratch even though many research teams collect data from similar mobile sensors, platforms and devices. As a result, the quality of code varies, code is often not shared alongside publications, and when it is, it might not be stored on a version control system and most of the time there is no guarantee the development environment can be replicated. This makes it difficult for other scientists to read, reuse, audit, and reproduce a publication’s code and its results.

Objective:

We present RAPIDS, a reproducible pipeline to standardize the preprocessing, feature extraction, analysis, visualization, and reporting of data streams coming from mobile sensors.

Methods:

RAPIDS is formed by a group of R and Python scripts that are executed on top of reproducible virtual environments, orchestrated by Snakemake and organized following the cookiecutter data science project. Its development has been and will be informed by public discussions with the mobile sensing research community.

Results:

We share open source, documented, extensible and tested code to preprocess and extract behavioral features from data collected with the AWARE Framework in Android and iOS smartphones as well as Fitbit devices. We also provide a file structure and development environment that other researchers can follow to publish their own models, visualizations, and reports.

Conclusions:

RAPIDS allows researchers to process mobile sensor data in a rigorous and reproducible way. This saves time and effort during the data analysis phase of a project and makes it easier to share an analysis workflow alongside publications.


 Citation

Please cite as:

Vega J, Li M, Aguillera K, Goel N, Joshi E, Durica KC, Kunta AR, Low CA

RAPIDS: Reproducible Analysis Pipeline for Data Streams Collected with Mobile Devices

JMIR Preprints. 18/08/2020:23246

DOI: 10.2196/preprints.23246

URL: https://preprints.jmir.org/preprint/23246

Download PDF


Request queued. Please wait while the file is being generated. It may take some time.

© The authors. All rights reserved. This is a privileged document currently under peer-review/community review (or an accepted/rejected manuscript). Authors have provided JMIR Publications with an exclusive license to publish this preprint on it's website for review and ahead-of-print citation purposes only. While the final peer-reviewed paper may be licensed under a cc-by license on publication, at this stage authors and publisher expressively prohibit redistribution of this draft paper other than for review purposes.