Accepted for/Published in: JMIR Formative Research
Date Submitted: Feb 8, 2021
Date Accepted: Dec 14, 2021
Prediction of Age-Adjusted Mortality From Stroke in Japanese Prefectures by Search Engine Query: An Ecological Study
ABSTRACT
Background:
Stroke is a major cause of death and nursing care in Japan, and regional disparities are large.
Objective:
The purpose of this study was to clarify the association between stroke-related information retrieval behavior and age-adjusted mortality in each prefecture in Japan.
Methods:
Age-adjusted mortality from stroke and aging rates were obtained from publicly available Japanese government statistics. A total of 9476 abstracts of Japanese articles related to symptoms and signs of stroke were identified in Ichushi-Web, a Japanese online database of biomedical articles, and 100 highly frequent words (Stroke 100) were extracted. Using data from 2014 to 2019, a random forest analysis was carried out using the age-adjusted mortality from stroke of 47 prefectures as the outcome variable and the standardized retrieval numbers of the Stroke 100 words in the log data of Yahoo! JAPAN Search as predictive variables. Regression analysis was performed using a generalized linear mixed model (GLMM) with the number of standardized searches for Stroke 100 words with high importance scores in the random forest model as the predictive variable. In the regression analysis with GLMM, the aging rate and data year were used as control variables, and the random slope of data year and random intercept were calculated by prefecture.
Results:
The mean age-adjusted mortality from stroke was 28.07 ± 4.55 for all prefectures in all data years. The accuracy score of the random forest analysis was 89.94%, the average error was 2.79 degrees, and the mean squared error was 13.57. Nine variables with high importance scores in the random forest were selected as predictive variables of the regression analysis: male, age, hospitalization, enforcement, progress, stroke, abnormal, use, and change. As a result of the regression analysis with GLMM, the standardized partial regression coefficients (β) and 95% confidence intervals (95% CI) showed that the following internet search terms were significantly associated with age-adjusted mortality from stroke: male (β=−5.83, 95% CI=−8.67 to −3.29), age (β=−5.83, 95% CI=−8.67 to −3.29), hospitalization (β=−5.83, 95% CI=−8.67 to −3.29), and abnormal (β=3.83, 95% CI=1.14 to 6.56).
Conclusions:
Stroke-related search behavior was associated with age-adjusted mortality from stroke in each prefecture in Japan. Query terms that were strongly associated with age-adjusted mortality rates of stroke suggested the possibility that individual characteristics such as sex and age have an impact on stroke-associated mortality and that it is important to receive medical care early after stroke onset. Further studies on the criteria and timing of alerting are needed by monitoring information-seeking behavior to identify queries that are strongly associated with stroke mortality.
Citation
Request queued. Please wait while the file is being generated. It may take some time.
Copyright
© The authors. All rights reserved. This is a privileged document currently under peer-review/community review (or an accepted/rejected manuscript). Authors have provided JMIR Publications with an exclusive license to publish this preprint on it's website for review and ahead-of-print citation purposes only. While the final peer-reviewed paper may be licensed under a cc-by license on publication, at this stage authors and publisher expressively prohibit redistribution of this draft paper other than for review purposes.