Maintenance Notice

Due to necessary scheduled maintenance, the JMIR Publications website will be unavailable from Wednesday, July 01, 2020 at 8:00 PM to 10:00 PM EST. We apologize in advance for any inconvenience this may cause you.

Who will be affected?

Accepted for/Published in: JMIR Diabetes

Date Submitted: Nov 24, 2018
Date Accepted: Jul 19, 2019

The final, peer-reviewed published version of this preprint can be found here:

A Reinforcement Learning–Based Method for Management of Type 1 Diabetes: Exploratory Study

Oroojeni Mohammad Javad M, Agboola SO, Jethwani K, Zeid A, Kamarthi S

A Reinforcement Learning–Based Method for Management of Type 1 Diabetes: Exploratory Study

JMIR Diabetes 2019;4(3):e12905

DOI: 10.2196/12905

PMID: 31464196

PMCID: 6737889

Warning: This is an author submission that is not peer-reviewed or edited. Preprints - unless they show as "accepted" - should not be relied on to guide clinical practice or health-related behavior and should not be reported in news media as established information.

A Reinforcement Learning–Based Method for Management of Type 1 Diabetes: Exploratory Study

  • Mahsa Oroojeni Mohammad Javad; 
  • Stephen Olusegun Agboola; 
  • Kamal Jethwani; 
  • Abe Zeid; 
  • Sagar Kamarthi

Background:

Type 1 diabetes mellitus (T1DM) is characterized by chronic insulin deficiency and consequent hyperglycemia. Patients with T1DM require long-term exogenous insulin therapy to regulate blood glucose levels and prevent the long-term complications of the disease. Currently, there are no effective algorithms that consider the unique characteristics of T1DM patients to automatically recommend personalized insulin dosage levels.

Objective:

The objective of this study was to develop and validate a general reinforcement learning (RL) framework for the personalized treatment of T1DM using clinical data.

Methods:

This research presents a model-free data-driven RL algorithm, namely Q-learning, that recommends insulin doses to regulate the blood glucose level of a T1DM patient, considering his or her state defined by glycated hemoglobin (HbA1c) levels, body mass index, engagement in physical activity, and alcohol usage. In this approach, the RL agent identifies the different states of the patient by exploring the patient’s responses when he or she is subjected to varying insulin doses. On the basis of the result of a treatment action at time step t, the RL agent receives a numeric reward, positive or negative. The reward is calculated as a function of the difference between the actual blood glucose level achieved in response to the insulin dose and the targeted HbA1c level. The RL agent was trained on 10 years of clinical data of patients treated at the Mass General Hospital.

Results:

A total of 87 patients were included in the training set. The mean age of these patients was 53 years, 59% (51/87) were male, 86% (75/87) were white, and 47% (41/87) were married. The performance of the RL agent was evaluated on 60 test cases. RL agent–recommended insulin dosage interval includes the actual dose prescribed by the physician in 53 out of 60 cases (53/60, 88%).

Conclusions:

This exploratory study demonstrates that an RL algorithm can be used to recommend personalized insulin doses to achieve adequate glycemic control in patients with T1DM. However, further investigation in a larger sample of patients is needed to confirm these findings.


 Citation

Please cite as:

Oroojeni Mohammad Javad M, Agboola SO, Jethwani K, Zeid A, Kamarthi S

A Reinforcement Learning–Based Method for Management of Type 1 Diabetes: Exploratory Study

JMIR Diabetes 2019;4(3):e12905

DOI: 10.2196/12905

PMID: 31464196

PMCID: 6737889

Per the author's request the PDF is not available.