Maintenance Notice

Due to necessary scheduled maintenance, the JMIR Publications website will be unavailable from Wednesday, July 01, 2020 at 8:00 PM to 10:00 PM EST. We apologize in advance for any inconvenience this may cause you.

Who will be affected?

Accepted for/Published in: JMIR AI

Date Submitted: Nov 26, 2023
Open Peer Review Period: Nov 26, 2023 - Jan 29, 2024
Date Accepted: Jun 13, 2024
(closed for review but you can still tweet)

The final, peer-reviewed published version of this preprint can be found here:

Use of Deep Neural Networks to Predict Obesity With Short Audio Recordings: Development and Usability Study

Huang J, Guo P, Zhang S, An R, Ji M

Use of Deep Neural Networks to Predict Obesity With Short Audio Recordings: Development and Usability Study

JMIR AI 2024;3:e54885

DOI: 10.2196/54885

PMID: 39052997

PMCID: 11310637

Use of Deep Neural Networks to Predict Obesity with Short Audio Recordings: A Pilot Study

  • Jingyi Huang; 
  • Peiqi Guo; 
  • Sheng Zhang; 
  • Ruopeng An; 
  • Mengmeng Ji

ABSTRACT

Background:

The escalating global prevalence of obesity has necessitated the exploration of novel diagnostic approaches. Recent scientific inquiries have indicated potential alterations in voice characteristics associated with obesity, suggesting the feasibility of using voice as a non-invasive biomarker for obesity detection.

Objective:

This study aims to utilize deep neural networks to predict obesity status through the analysis of short audio recordings, investigating the relationship between vocal characteristics and obesity.

Methods:

A pilot study was conducted with 696 participants, using self-reported body mass index (BMI) to classify individuals into obesity and non-obesity groups. Audio recordings of participants reading a short script were transformed into spectrograms and analyzed using an adapted YOLOv8 model. The model performance was evaluated through accuracy, recall, precision, and F1 scores.

Results:

The adapted YOLOv8 model demonstrated a global accuracy of 0.70 and a macro-F1 score of 0.65. It was more effective in identifying non-obesity (F1 score of 0.77) compared to obesity (F1 score of 0.53). This moderate level of accuracy highlights the potential and challenges in using vocal biomarkers for obesity detection.

Conclusions:

While the study shows promise in the field of voice-based medical diagnostics for obesity, it faces limitations such as reliance on self-reported BMI data and a small, homogenous sample size. These factors, coupled with variability in recording quality, necessitate further research with more robust methodologies and diverse samples to enhance the validity of this novel approach. The findings lay a foundational step for future investigations in using voice as a non-invasive biomarker for obesity detection.


 Citation

Please cite as:

Huang J, Guo P, Zhang S, An R, Ji M

Use of Deep Neural Networks to Predict Obesity With Short Audio Recordings: Development and Usability Study

JMIR AI 2024;3:e54885

DOI: 10.2196/54885

PMID: 39052997

PMCID: 11310637

Download PDF


Request queued. Please wait while the file is being generated. It may take some time.

© The authors. All rights reserved. This is a privileged document currently under peer-review/community review (or an accepted/rejected manuscript). Authors have provided JMIR Publications with an exclusive license to publish this preprint on it's website for review and ahead-of-print citation purposes only. While the final peer-reviewed paper may be licensed under a cc-by license on publication, at this stage authors and publisher expressively prohibit redistribution of this draft paper other than for review purposes.