JMIR Preprints #54885: Use of Deep Neural Networks to Predict Obesity with Short Audio Recordings: A Pilot Study

Current Preprint Settings

(as selected by the authors)

1. When the manuscript is submitted, allow peer review from:

(a) Anybody (open community peer review)
(b) Editor-selected reviewers (closed peer review)

2. When the manuscript is submitted, display the preprint PDF to:

(a) Anybody, anytime
(b) Logged-in users only
(c) Anybody, anytime (title and abstract only)
(d) No one

3. When the manuscript is accepted, display the accepted manuscript PDF to:

(a) Anybody, anytime
(b) Logged-in users only
(c) Anybody, anytime (title and abstract only)
(d) No one

Use of Deep Neural Networks to Predict Obesity with Short Audio Recordings: A Pilot Study

Jingyi Huang;
Peiqi Guo;
Sheng Zhang;
Ruopeng An;
Mengmeng Ji

ABSTRACT

Background:

The escalating global prevalence of obesity has necessitated the exploration of novel diagnostic approaches. Recent scientific inquiries have indicated potential alterations in voice characteristics associated with obesity, suggesting the feasibility of using voice as a non-invasive biomarker for obesity detection.

Objective:

This study aims to utilize deep neural networks to predict obesity status through the analysis of short audio recordings, investigating the relationship between vocal characteristics and obesity.

Methods:

A pilot study was conducted with 696 participants, using self-reported body mass index (BMI) to classify individuals into obesity and non-obesity groups. Audio recordings of participants reading a short script were transformed into spectrograms and analyzed using an adapted YOLOv8 model. The model performance was evaluated through accuracy, recall, precision, and F1 scores.

Results:

The adapted YOLOv8 model demonstrated a global accuracy of 0.70 and a macro-F1 score of 0.65. It was more effective in identifying non-obesity (F1 score of 0.77) compared to obesity (F1 score of 0.53). This moderate level of accuracy highlights the potential and challenges in using vocal biomarkers for obesity detection.

Conclusions:

While the study shows promise in the field of voice-based medical diagnostics for obesity, it faces limitations such as reliance on self-reported BMI data and a small, homogenous sample size. These factors, coupled with variability in recording quality, necessitate further research with more robust methodologies and diverse samples to enhance the validity of this novel approach. The findings lay a foundational step for future investigations in using voice as a non-invasive biomarker for obesity detection.

Citation

Please cite as:

Huang J, Guo P, Zhang S, An R, Ji M

Use of Deep Neural Networks to Predict Obesity With Short Audio Recordings: Development and Usability Study

JMIR AI 2024;3:e54885

DOI: 10.2196/54885

PMID: 39052997

PMCID: 11310637

Download PDF

Request queued. Please wait while the file is being generated. It may take some time.

Copyright

© The authors. All rights reserved. This is a privileged document currently under peer-review/community review (or an accepted/rejected manuscript). Authors have provided JMIR Publications with an exclusive license to publish this preprint on it's website for review and ahead-of-print citation purposes only. While the final peer-reviewed paper may be licensed under a cc-by license on publication, at this stage authors and publisher expressively prohibit redistribution of this draft paper other than for review purposes.

JMIR Publications

JMIR Preprints

Accepted for/Published in: JMIR AI

Date Submitted: Nov 26, 2023

Open Peer Review Period: Nov 26, 2023 - Jan 29, 2024

Date Accepted: Jun 13, 2024

(closed for review but you can still tweet)

Use of Deep Neural Networks to Predict Obesity with Short Audio Recordings: A Pilot Study

ABSTRACT

Citation

Copyright