Maintenance Notice

Due to necessary scheduled maintenance, the JMIR Publications website will be unavailable from Wednesday, July 01, 2020 at 8:00 PM to 10:00 PM EST. We apologize in advance for any inconvenience this may cause you.

Who will be affected?

Accepted for/Published in: JMIR Formative Research

Date Submitted: Jan 14, 2025
Date Accepted: Jan 28, 2026

The final, peer-reviewed published version of this preprint can be found here:

Personalized Glucose Management With AI: Pilot Study Using a Multiarmed Bandit Approach

Hotta S, Kytö M, Koivusalo S, Heinonen S, Marttinen P

Personalized Glucose Management With AI: Pilot Study Using a Multiarmed Bandit Approach

JMIR Form Res 2026;10:e70826

DOI: 10.2196/70826

PMID: 41855477

Personalized glucose management with AI: A pilot study using a multi-armed bandit approach

  • Shinji Hotta; 
  • Mikko Kytö; 
  • Saila Koivusalo; 
  • Seppo Heinonen; 
  • Pekka Marttinen

ABSTRACT

Background:

Personalized behavioral recommendations through mobile applications have proven effective in preventing serious chronic diseases such as diabetes. Recent studies have primarily focused on the personally optimizing recommendations using reinforcement learning. However, the main problem with these approaches is that they focus on behavior changes and overlook clinical outcomes.

Objective:

In this study, we proposed a method for online planning of dietary and exercise recommendations to directly optimize postprandial glucose levels through behavioral changes.

Methods:

The proposed method is a multi-arm bandit based on a two-stage reward prediction model, wherein an action is a combination of the total carbohydrate intake and postprandial walking duration, and the reward is how low the postprandial glucose levels are. We realized predicting the reward for each action based on the predicted behavioral responses to an action, and subsequently, the postprandial glycemic response, using the predicted responses.

Results:

In a simulation experiment, we demonstrated that the proposed online algorithm can significantly improve postprandial glucose levels with personalized recommendations, compared to the randomized policy. Furthermore, we conducted a small real-world experiment with a simplified proposed method involving a single update of the recommendation policy into a personalized one. For six human subjects, compared to the randomized policy, we confirmed 23% improvement (on average) in actual glucose responses along with the behavioral adherence to the recommendations concerning carbohydrate intake and postprandial walking.

Conclusions:

The effectiveness of the proposed method was demonstrated from both the simulation experiment and the small real-world experiment. However, further longitudinal real-world experiment in patients with diabetes is needed to confirm the finding.


 Citation

Please cite as:

Hotta S, Kytö M, Koivusalo S, Heinonen S, Marttinen P

Personalized Glucose Management With AI: Pilot Study Using a Multiarmed Bandit Approach

JMIR Form Res 2026;10:e70826

DOI: 10.2196/70826

PMID: 41855477

Download PDF


Request queued. Please wait while the file is being generated. It may take some time.

© The authors. All rights reserved. This is a privileged document currently under peer-review/community review (or an accepted/rejected manuscript). Authors have provided JMIR Publications with an exclusive license to publish this preprint on it's website for review and ahead-of-print citation purposes only. While the final peer-reviewed paper may be licensed under a cc-by license on publication, at this stage authors and publisher expressively prohibit redistribution of this draft paper other than for review purposes.