Accepted for/Published in: JMIR Formative Research
Date Submitted: Dec 16, 2024
Date Accepted: Jun 18, 2025
Deep Learning for Dietary Assessment: A Study on YOLO Models and the Swedish Plate Model
ABSTRACT
Background:
In recent years, the field of computer vision has seen remarkable advancements, particularly with the rise of deep learning techniques, in identifying objects in images. We investigate the application of object detection models, specifically You Only Look Once (YOLO), in the context of food and portion recognition.
Objective:
The objective of the study is to assess the performance of three YOLO models in accurately identifying food components within images and their proportions in terms of the Swedish plate model as per the guidelines of the National Food Agency.
Methods:
The research utilizes a custom dataset comprising 3707 images with 42 different food classes. Various preprocessing and augmentation techniques were applied to enhance dataset quality and model robustness. The performance of the three YOLO models (YOLOv7, YOLOv8, and YOLOv9) are evaluated using precision, recall, mean Average Precision (mAP), and F1 score metrics.
Results:
Our analysis demonstrates that YOLOv8 showed higher performance. While the metrics were satisfactory, the models still had difficulty distinguishing between different food items.
Conclusions:
We determine that further enhancements to both the technology and the training are required for such tools to be included in health monitoring applications.
Citation
Request queued. Please wait while the file is being generated. It may take some time.
Copyright
© The authors. All rights reserved. This is a privileged document currently under peer-review/community review (or an accepted/rejected manuscript). Authors have provided JMIR Publications with an exclusive license to publish this preprint on it's website for review and ahead-of-print citation purposes only. While the final peer-reviewed paper may be licensed under a cc-by license on publication, at this stage authors and publisher expressively prohibit redistribution of this draft paper other than for review purposes.