Maintenance Notice

Due to necessary scheduled maintenance, the JMIR Publications website will be unavailable from Wednesday, July 01, 2020 at 8:00 PM to 10:00 PM EST. We apologize in advance for any inconvenience this may cause you.

Who will be affected?

Accepted for/Published in: JMIR Medical Informatics

Date Submitted: May 11, 2025
Date Accepted: Sep 26, 2025

The final, peer-reviewed published version of this preprint can be found here:

Cough Audio Recognition for Early Detection of Respiratory Diseases: Algorithm Development and Validation Study

Sun W, Jiahao Z, Yin N, Yin N, Yin N, Sun J, Fang W, Miao Z, Yang SY

Cough Audio Recognition for Early Detection of Respiratory Diseases: Algorithm Development and Validation Study

JMIR Med Inform 2026;14:e77295

DOI: 10.2196/77295

PMID: 42096594

PMCID: 13152037

Warning: This is an author submission that is not peer-reviewed or edited. Preprints - unless they show as "accepted" - should not be relied on to guide clinical practice or health-related behavior and should not be reported in news media as established information.

Application of Cough Audio Recognition based on Machine Learning in Early Diagnosis of Diseases;

  • Wensheng Sun; 
  • Zou Jiahao; 
  • Na Yin; 
  • Na Yin; 
  • Na Yin; 
  • Jimin Sun; 
  • Wenying Fang; 
  • Ziping Miao; 
  • Shigui Yang Yang

ABSTRACT

Background:

Coughing is a common clinical symptom and a protective respiratory reflex closely associated with various respiratory system diseases. The acoustic characteristics of cough sounds are influenced by underlying pathological factors, with distinct acoustic signatures corresponding to different etiologies. Through rigorously analysis of these sounds, rapid identification and preliminary diagnosis of related conditions may be achieved. diseases. This approach holds great potential for broad application in mobile health (mHealth) and ubiquitous health (uHealth) platforms.

Objective:

Exploring the application of acoustic analysis of cough sounds in the diagnosis of respiratory diseases to enhance the diagnostic efficiency of healthcare professionals.

Methods:

In this study, we conducted extensive data collection, including voluntary cough audio recordings from patients diagnosed with respiratory diseases (such as chronic obstructive pulmonary disease, lung cancer, COVID-19, and pneumonia), as well as from healthy participants. A total of 2,610 audio samples were collected. We incorporated a Channel Attention Mechanism (CAM) into the final convolutional block of each residual block in the ResNet18 neural network, thereby constructing the CAM-ResNet18 neural network model. The recorded cough audio samples were converted into spectrograms to form the input dataset for model training. The CAM-ResNet18 model was trained on the training set of this dataset, with iterative parameter adjustments until convergence was achieved. Finally, spectrograms from the test set were fed into the pre-trained model for accurate classification of the cough-related conditions.

Results:

Experiments results on the collected audio dataset demonnstrate that the proposed CAM-ResNet18 model achieves an accuracy of 83.9% and an average F1 score of 82.52% in classifying these five types of cough sounds. In comparison, the traditional ResNet18 model achieves an accuracy of 78.16% and an average F1 score of 78.29%, indicating a clear performance improvement with the integration of the Channel Attention Mechanism.

Conclusions:

The experimental results validate the effectiveness of the proposed method, highlighting its significant potential for application in clinical diagnosis.


 Citation

Please cite as:

Sun W, Jiahao Z, Yin N, Yin N, Yin N, Sun J, Fang W, Miao Z, Yang SY

Cough Audio Recognition for Early Detection of Respiratory Diseases: Algorithm Development and Validation Study

JMIR Med Inform 2026;14:e77295

DOI: 10.2196/77295

PMID: 42096594

PMCID: 13152037

Download PDF


Request queued. Please wait while the file is being generated. It may take some time.

© The authors. All rights reserved. This is a privileged document currently under peer-review/community review (or an accepted/rejected manuscript). Authors have provided JMIR Publications with an exclusive license to publish this preprint on it's website for review and ahead-of-print citation purposes only. While the final peer-reviewed paper may be licensed under a cc-by license on publication, at this stage authors and publisher expressively prohibit redistribution of this draft paper other than for review purposes.