Accepted for/Published in: JMIR Medical Informatics
Date Submitted: Feb 20, 2024
Open Peer Review Period: Feb 23, 2024 - Apr 19, 2024
Date Accepted: Mar 28, 2025
(closed for review but you can still tweet)
Warning: This is an author submission that is not peer-reviewed or edited. Preprints - unless they show as "accepted" - should not be relied on to guide clinical practice or health-related behavior and should not be reported in news media as established information.
Can we predict snacking behaviour just from previous instances?
ABSTRACT
Background:
Consuming too much food or drink with high levels of saturated fats, salt or sugar can be harmful for health. Many snack foods fall into this category (HFSS snacks). However, the palatability of these snacks means that people can sometimes struggle to reduce their intake. Machine learning algorithms could help by predicting the likely occurrence of HFSS snacking, so that just-in-time adaptive (JITAI) interventions can be deployed. However, HFSS snacking data has characteristics (such as sparseness and incompleteness), which make snacking prediction a challenging machine learning problem. Previous attempts have employed several potential predictor variables, achieving considerable success. Nevertheless, collecting information along several dimensions requires several potentially burdensome user questionnaires per day, so that this approach may be less acceptable among the general public.
Objective:
Our aim is to consider the capacity of machine learning algorithms to predict HFSS snacking based on minimal data that can be collected in a mostly automated way: day of week, time of day, and location (coarsened as work, home, other).
Methods:
A sample of 111 participants in the UK were asked to record HFSS snack occurrences and location category, over a period of 28 days, leading to a new dataset on HFSS snacks. Data collection was facilitated by a purpose-specific app. Additionally, we use a similar dataset from the Netherlands. For both datasets, we employ machine learning methods (random forests and neural networks).
Results:
We report results concerning the ability of machine learning methods to predict the time of the next HFSS snack. The quality of the prediction depended on both the dataset and temporal resolution employed. In some cases, predictions were accurate to as few as 17 minutes on average.
Conclusions:
We demonstrated that prediction of HFSS snacks on sparse data is possible to reasonable accuracy. We consider the type of prediction problem which may be most suitable for putative interventions in relation to HFSS snacking. While we think it was important to employ standard machine learning algorithms in this work, we also discuss ways to tailor both the machine learning algorithms and the prediction problem to better align with the unique characteristics of the problem.
Citation
Request queued. Please wait while the file is being generated. It may take some time.
Copyright
© The authors. All rights reserved. This is a privileged document currently under peer-review/community review (or an accepted/rejected manuscript). Authors have provided JMIR Publications with an exclusive license to publish this preprint on it's website for review and ahead-of-print citation purposes only. While the final peer-reviewed paper may be licensed under a cc-by license on publication, at this stage authors and publisher expressively prohibit redistribution of this draft paper other than for review purposes.