Accepted for/Published in: JMIR Medical Informatics
Date Submitted: Oct 20, 2018
Open Peer Review Period: Oct 25, 2018 - Nov 26, 2018
Date Accepted: Jan 21, 2019
(closed for review but you can still tweet)
Medication Adherence Prediction from Online Social Forums: A Machine Learning Model for Fibromyalgia.
ABSTRACT
Background:
Medication non-adherence can compound into severe medical problems for patients. Identifying patients that are likely to become non-adherent may help reduce these problems. Data-driven machine learning models can predict medication adherence by using selected indicators from patients’ past health records. Sources of data for these models traditionally fall under two main categories: a) proprietary data from insurance claims, pharmacy prescriptions, or electronic medical records and b) survey data collected from representative groups of patients. Models developed using these data sources often are limited because they are: proprietary, subject to high cost, have limited scalability, or lack timely accessibility. These limitations suggest health social forums might be an alternate source of data for adherence prediction. Indeed, this data is accessible, affordable, timely and available at scale. However, it can also be inaccurate.
Objective:
This paper proposes a medication adherence machine learning model for fibromyalgia therapies that can mitigate the inaccuracy of health social forum data.
Methods:
Transfer learning is a machine learning technique that allows knowledge acquired from one dataset to be transferred to another dataset. In this study, predictive adherence models for the target disease were first developed by using accurate but limited survey data. These models were then used to predict medication adherence from health social forum data. Random forest (RF), an ensemble machine learning technique, was used to develop the predictive models. This transfer learning methodology is demonstrated here by examining data from the Medical Expenditure Panel Survey (MEPS) and the PatientsLikeMe health social forum.
Results:
When the models are carefully designed, less than a 5% difference in accuracy is observed between the MEPS and the PatientsLikeMe medication adherence predictions for fibromyalgia treatments. This design must take into consideration the mapping between the predictors and the outcomes in the two datasets.
Conclusions:
This study exemplifies the potential and the limitations of transfer learning in medication adherence predictive models based on survey data and health social forum data. The proposed approach can make timely medication adherence monitoring cost-effective and widely accessible. Additional investigation is needed to improve the robustness of the approach and to extend its applicability to other therapies and other sources of data.
Citation
Per the author's request the PDF is not available.
Copyright
© The authors. All rights reserved. This is a privileged document currently under peer-review/community review (or an accepted/rejected manuscript). Authors have provided JMIR Publications with an exclusive license to publish this preprint on it's website for review and ahead-of-print citation purposes only. While the final peer-reviewed paper may be licensed under a cc-by license on publication, at this stage authors and publisher expressively prohibit redistribution of this draft paper other than for review purposes.