Maintenance Notice

Due to necessary scheduled maintenance, the JMIR Publications website will be unavailable from Wednesday, July 01, 2020 at 8:00 PM to 10:00 PM EST. We apologize in advance for any inconvenience this may cause you.

Who will be affected?

Accepted for/Published in: Journal of Medical Internet Research

Date Submitted: Mar 5, 2020
Date Accepted: Apr 16, 2020

The final, peer-reviewed published version of this preprint can be found here:

Healthcare Research and Analytics Data Infrastructure Solution: A Data Warehouse for Health Services Research

Ozaydin B, Zengul F, Oner N, Feldman SS

Healthcare Research and Analytics Data Infrastructure Solution: A Data Warehouse for Health Services Research

J Med Internet Res 2020;22(6):e18579

DOI: 10.2196/18579

PMID: 32496199

PMCID: 7303827

Healthcare Research and Analytics Data Infrastructure Solution: A Data Warehouse for Health Services Research

  • Bunyamin Ozaydin; 
  • Ferhat Zengul; 
  • Nurettin Oner; 
  • Sue S Feldman

Background:

Health services researchers spend a substantial amount of time performing integration, cleansing, interpretation, and aggregation of raw data from multiple public or private data sources. Often, each researcher (or someone in their team) duplicates this effort for their own project, facing the same challenges and experiencing the same pitfalls discovered by those before them.

Objective:

This paper described a design process for creating a data warehouse that includes the most frequently used databases in health services research.

Methods:

The design is based on a conceptual iterative process model framework that utilizes the sociotechnical systems theory approach and includes the capacity for subsequent updates of the existing data sources and the addition of new ones. We introduce the theory and the framework and then explain how they are used to inform the methodology of this study.

Results:

The application of the iterative process model to the design research process of problem identification and solution design for the Healthcare Research and Analytics Data Infrastructure Solution (HRADIS) is described. Each phase of the iterative model produced end products to inform the implementation of HRADIS. The analysis phase produced the problem statement and requirements documents. The projection phase produced a list of tasks and goals for the ideal system. Finally, the synthesis phase provided the process for a plan to implement HRADIS. HRADIS structures and integrates data dictionaries provided by the data sources, allowing the creation of dimensions and measures for a multidimensional business intelligence system. We discuss how HRADIS is complemented with a set of data mining, analytics, and visualization tools to enable researchers to more efficiently apply multiple methods to a given research project. HRADIS also includes a built-in security and account management framework for data governance purposes to ensure customized authorization depending on user roles and parts of the data the roles are authorized to access.

Conclusions:

To address existing inefficiencies during the obtaining, extracting, preprocessing, cleansing, and filtering stages of data processing in health services research, we envision HRADIS as a full-service data warehouse integrating frequently used data sources, processes, and methods along with a variety of data analytics and visualization tools. This paper presents the application of the iterative process model to build such a solution. It also includes a discussion on several prominent issues, lessons learned, reflections and recommendations, and future considerations, as this model was applied.

Clinicaltrial:


 Citation

Please cite as:

Ozaydin B, Zengul F, Oner N, Feldman SS

Healthcare Research and Analytics Data Infrastructure Solution: A Data Warehouse for Health Services Research

J Med Internet Res 2020;22(6):e18579

DOI: 10.2196/18579

PMID: 32496199

PMCID: 7303827

Download PDF


Request queued. Please wait while the file is being generated. It may take some time.

© The authors. All rights reserved. This is a privileged document currently under peer-review/community review (or an accepted/rejected manuscript). Authors have provided JMIR Publications with an exclusive license to publish this preprint on it's website for review and ahead-of-print citation purposes only. While the final peer-reviewed paper may be licensed under a cc-by license on publication, at this stage authors and publisher expressively prohibit redistribution of this draft paper other than for review purposes.