Maintenance Notice

Due to necessary scheduled maintenance, the JMIR Publications website will be unavailable from Wednesday, July 01, 2020 at 8:00 PM to 10:00 PM EST. We apologize in advance for any inconvenience this may cause you.

Who will be affected?

Previously submitted to: Journal of Medical Internet Research (no longer under consideration since Jan 06, 2023)

Date Submitted: Apr 27, 2022

Warning: This is an author submission that is not peer-reviewed or edited. Preprints - unless they show as "accepted" - should not be relied on to guide clinical practice or health-related behavior and should not be reported in news media as established information.

Digital Health Discussion Through Articles Published Until the Year 2021: A Digital Topic Modeling Approach

  • Junhyoun Sung; 
  • Hyoungsook Kim

ABSTRACT

Background:

Since the 2010s, the digital health industry has grown significantly, gaining popularity with the public. The term “Digital Health” is being explored in various academic fields, such as public health, medicine, and computer science.

Objective:

This study analyzes the research trends of digital health related articles published in the Web of Science until 2021 to understand the research concentration, boundary, scope, and characteristics.

Methods:

By crawling and preprocessing 27,638 digital health-related papers provided by Web of Science and investigating 15,950 of them, the number of articles published by year and by field are compared and analyzed. Since these 15,950 papers belong to the top 10 academic fields, they were regrouped into three major fields: public health, medicine, and electrical engineering and computer science (EECS). Latent Dirichlet Allocation (LDA) is applied as a topic modeling method for each field and time period. The number of topics is determined based on the coherence score.

Results:

The number of optimal topics in the first and second halves for public health were 13 and 19, for medicine, 14 and 25, and for EECS, 7 and 21, respectively. Text analysis showed that articles from public health, medicine, and EECS share similar topics but vary in composition. The homogeneity test showed that the contrast between each group is significant (p<2.2e-16). All the topics revealed in articles could be categorized into six dominant themes; journal article methodology, information technology, medical issues, subject, social phenomenon, and healthcare. As a result of the LDA analysis, the topics of each domain differed, and the composition of each theme was different between academic fields and time periods. Studies on public health focused on social phenomena, prevention, and daily care, while studies in medicine investigated treatment and cure issues in the second half. Studies in EECS highlighted the importance of technical issues, while showing a comparatively distant relation to public health or medicine. All fields emphasized information technology (IT) in the first half, and each domain published specialized articles in the second half. In particular, there were numerous articles belonging to both public health and medicine, while only a few were common with EECS.

Conclusions:

The articles belonging to each domain became more specialized and distinguished from other domains and all three fields highlighted social phenomena and healthcare over time. With Covid-19 becoming a dominant issue recently, digital health has come to be strongly related to depression and mental disorders, education, and physical activity with articles on these topics appearing in the second half in all fields. The scope of digital health research is expanding and its composition fluctuating. In the future, it will be necessary to explore papers on expanded topics that reflect people's needs for digital health.


 Citation

Please cite as:

Sung J, Kim H

Digital Health Discussion Through Articles Published Until the Year 2021: A Digital Topic Modeling Approach

JMIR Preprints. 27/04/2022:39027

DOI: 10.2196/preprints.39027

URL: https://preprints.jmir.org/preprint/39027

Download PDF


Request queued. Please wait while the file is being generated. It may take some time.

© The authors. All rights reserved. This is a privileged document currently under peer-review/community review (or an accepted/rejected manuscript). Authors have provided JMIR Publications with an exclusive license to publish this preprint on it's website for review and ahead-of-print citation purposes only. While the final peer-reviewed paper may be licensed under a cc-by license on publication, at this stage authors and publisher expressively prohibit redistribution of this draft paper other than for review purposes.