Accepted for/Published in: JMIR AI
Date Submitted: Nov 26, 2023
Open Peer Review Period: Nov 26, 2023 - Jan 29, 2024
Date Accepted: Jun 13, 2024
(closed for review but you can still tweet)
Use of Deep Neural Networks to Predict Obesity with Short Audio Recordings: A Pilot Study
ABSTRACT
Background:
The escalating global prevalence of obesity has necessitated the exploration of novel diagnostic approaches. Recent scientific inquiries have indicated potential alterations in voice characteristics associated with obesity, suggesting the feasibility of using voice as a non-invasive biomarker for obesity detection.
Objective:
This study aims to utilize deep neural networks to predict obesity status through the analysis of short audio recordings, investigating the relationship between vocal characteristics and obesity.
Methods:
A pilot study was conducted with 696 participants, using self-reported body mass index (BMI) to classify individuals into obesity and non-obesity groups. Audio recordings of participants reading a short script were transformed into spectrograms and analyzed using an adapted YOLOv8 model. The model performance was evaluated through accuracy, recall, precision, and F1 scores.
Results:
The adapted YOLOv8 model demonstrated a global accuracy of 0.70 and a macro-F1 score of 0.65. It was more effective in identifying non-obesity (F1 score of 0.77) compared to obesity (F1 score of 0.53). This moderate level of accuracy highlights the potential and challenges in using vocal biomarkers for obesity detection.
Conclusions:
While the study shows promise in the field of voice-based medical diagnostics for obesity, it faces limitations such as reliance on self-reported BMI data and a small, homogenous sample size. These factors, coupled with variability in recording quality, necessitate further research with more robust methodologies and diverse samples to enhance the validity of this novel approach. The findings lay a foundational step for future investigations in using voice as a non-invasive biomarker for obesity detection.
Citation
Request queued. Please wait while the file is being generated. It may take some time.
Copyright
© The authors. All rights reserved. This is a privileged document currently under peer-review/community review (or an accepted/rejected manuscript). Authors have provided JMIR Publications with an exclusive license to publish this preprint on it's website for review and ahead-of-print citation purposes only. While the final peer-reviewed paper may be licensed under a cc-by license on publication, at this stage authors and publisher expressively prohibit redistribution of this draft paper other than for review purposes.