JMIR Preprints #85475: Preliminary Evaluation of an LLM-Powered Chatbot for Osteoporosis Self-Management Education: A Formative Randomized Controlled Trial

Current Preprint Settings

(as selected by the authors)

1. When the manuscript is submitted, allow peer review from:

(a) Anybody (open community peer review)
(b) Editor-selected reviewers (closed peer review)

2. When the manuscript is submitted, display the preprint PDF to:

(a) Anybody, anytime
(b) Logged-in users only
(c) Anybody, anytime (title and abstract only)
(d) No one

3. When the manuscript is accepted, display the accepted manuscript PDF to:

(a) Anybody, anytime
(b) Logged-in users only
(c) Anybody, anytime (title and abstract only)
(d) No one

Preliminary Evaluation of an LLM-Powered Chatbot for Osteoporosis Self-Management Education: A Formative Randomized Controlled Trial

Jinling Huang;
Xiaolian Xin;
Chunyan He;
HaoYan Xiong;
Wenjun Pang;
Shaobo Pang;
Wei Sun;
Xianghua Ding

ABSTRACT

Background:

With the increasing burden of chronic diseases, self-management education (SME) is crucial. Traditional SME based on face-to-face oral delivery by clinicians is resource-intensive, and general digital tools, such as web-based platforms, often provide limited interactivity for patient learning. While chatbots based on large language models (LLMs) show promise in usability and interactivity, their real-world effectiveness still lacks empirical evidence.

Objective:

This study aimed to explore the feasibility and preliminary effectiveness of an LLM-based chatbot specifically designed for osteoporosis SME.

Methods:

A formative randomized controlled trial was conducted in a tertiary hospital from February 2024 to March 2025. Adults aged ≥18 years with osteoporosis were recruited and randomly assigned (1:1) to either the intervention (OPBot) group or a control group receiving traditional health education. The chatbot provided interactive educational content and Q&A support, while the control group received face-to-face education and written materials. Osteoporosis knowledge was assessed using the Osteoporosis Knowledge Assessment Tool at baseline and discharge. Nurses’ time spent on health education was self-recorded during each intervention session and aggregated across sessions. Adherence to disease management was assessed at 1, 3, and 6 months post-discharge via telephone using structured Likert-scale questionnaires. The reliability of OPBot responses was evaluated by two clinicians using a 5-point Likert scale, with inter-rater agreement calculated using Cohen’s kappa. Group comparisons were conducted using t-tests, Mann–Whitney U tests, and chi-square tests, and adherence outcomes were analyzed using mixed-effects models.

Results:

A total of 100 participants were randomized; 12 were excluded due to loss to follow-up, refusal of the second knowledge assessment, or death, leaving 88 participants for analysis (45 in the OPBot group and 43 in the control group). The OPBot group showed significantly higher post-intervention knowledge scores than the control group (median [IQR]: 80.0 [70.0–89.0] vs 75.0 [65.5–80.0]; P = .01). Nurses in the OPBot group spent less time on SME than those in the control group (median [IQR]: 5.0 [2.0–17.0] vs 23.0 [20.0–25.0] minutes; P < .001). For adherence outcomes, a significant group × time interaction was observed for calcium supplement intake (OR 1.49, 95% CI 1.08–2.06; P = .02), indicating differing adherence trajectories over time. The OPBot group also showed higher odds of consuming calcium-rich foods across time points (OR 2.87, 95% CI 1.04–7.89; nominal P = .04), although this association did not remain significant after Holm correction for multiple comparisons. No significant effects were observed for sun exposure, exercise, or total adherence scores. In the Q&A module, most OPBot responses were rated as highly reliable (89.4%), with almost perfect inter-rater agreement (Cohen’s κ = 0.83).

Conclusions:

LLM-based chatbots specifically designed for osteoporosis SME may be a feasible approach to improving patient knowledge, supporting adherence behaviors, and reducing healthcare workload. However, further large-scale studies are needed to confirm these findings.

Citation

Please cite as:

Huang J, Xin X, He C, Xiong H, Pang W, Pang S, Sun W, Ding X

Preliminary Evaluation of a Large Language Model–Powered Chatbot for Osteoporosis Self-Management Education: Formative Randomized Controlled Trial

JMIR Form Res 2026;10:e85475

DOI: 10.2196/85475

PMID: 42228937

PMCID: 13273208

Download PDF

Request queued. Please wait while the file is being generated. It may take some time.

Copyright

© The authors. All rights reserved. This is a privileged document currently under peer-review/community review (or an accepted/rejected manuscript). Authors have provided JMIR Publications with an exclusive license to publish this preprint on it's website for review and ahead-of-print citation purposes only. While the final peer-reviewed paper may be licensed under a cc-by license on publication, at this stage authors and publisher expressively prohibit redistribution of this draft paper other than for review purposes.

JMIR Publications

JMIR Preprints

Accepted for/Published in: JMIR Formative Research

Date Submitted: Oct 8, 2025

Open Peer Review Period: Oct 8, 2025 - Dec 3, 2025

Date Accepted: May 12, 2026

(closed for review but you can still tweet)

Preliminary Evaluation of an LLM-Powered Chatbot for Osteoporosis Self-Management Education: A Formative Randomized Controlled Trial

ABSTRACT

Citation

Copyright