Maintenance Notice

Due to necessary scheduled maintenance, the JMIR Publications website will be unavailable from Wednesday, July 01, 2020 at 8:00 PM to 10:00 PM EST. We apologize in advance for any inconvenience this may cause you.

Who will be affected?

Accepted for/Published in: JMIR Medical Informatics

Date Submitted: Apr 17, 2025
Open Peer Review Period: May 7, 2025 - Jul 2, 2025
Date Accepted: Jul 6, 2025
(closed for review but you can still tweet)

The final, peer-reviewed published version of this preprint can be found here:

Enhancing Oral Health Diagnostics With Hyperspectral Imaging and Computer Vision: Clinical Dataset Study

Roemer P, Ponciano JJ, Kloster K, Siegberg F, Plaß B, Vinayahalingam S, Al-Nawas B, Kaemmerer P, Thomas K, Thiem D

Enhancing Oral Health Diagnostics With Hyperspectral Imaging and Computer Vision: Clinical Dataset Study

JMIR Med Inform 2025;13:e76148

DOI: 10.2196/76148

PMID: 40935589

PMCID: 12425605

Enhancing Oral Health Diagnostics with Hyperspectral Imaging and Computer Vision: A Clinical Dataset Study

  • Paul Roemer; 
  • Jean-Jacques Ponciano; 
  • Katharina Kloster; 
  • Fabia Siegberg; 
  • Bastian Plaß; 
  • Shankeeth Vinayahalingam; 
  • Bilal Al-Nawas; 
  • Peer Kaemmerer; 
  • Klauer Thomas; 
  • Daniel Thiem

ABSTRACT

Background:

Diseases of the oral cavity, including oral squamous cell carcinoma (OSCC) pose major challenges to healthcare worldwide due to its late diagnosis and complicated differentiation of oral tissues. Endoscopic hyperspectral imaging (eHSI) represents a promising approach to the demand for modern, non-invasive tissue diagnostics. This dataset is designed to enhance the performance of deep learning models by providing comprehensive spectral data essential for distinguishing between healthy and pathological oral tissue conditions.

Objective:

To develop and validate a clinical dataset of endoscopic hyperspectral imaging (eHSI) of the oral cavity and to evaluate the performance of deep learning-based semantic segmentation models for automated tissue classification.

Methods:

This clinical study included 226 participants (166 women, 60 men, aged 24-87). eHSI data were collected using an endoscopic hyperspectral sensor system, capturing spectral data in the range of 500-1000 nm. Each participant underwent five standardized intraoral hyperspectral scans of the cheek, palate, tongue, and teeth. RGB and eHSI images were archived in NPY format for Python analysis. Oral structures were annotated using RectLabel Pro©. DeepLabv3 with a ResNet-50 backbone was adapted for eHSI segmentation by modifying the first convolutional layer. The model was trained for 50 epochs on 70% of the dataset, with 30% for evaluation. Performance metrics (Precision, Recall, F1-score) confirmed its efficacy in distinguishing oral tissue types.

Results:

Preliminary analysis revealed that the Coefficient of Variation exceeded 15% for most spectral bands, indicating high variability in spectral signatures. DeepLabv3 (ResNet-101) achieved strong segmentation performance (F1-score: 0.856, Precision: 0.8506, Recall: 0.8634), excelling in key structures—mucosa (F1: 0.915), retractor (F1: 0.940), tooth (F1: 0.902), and palate (F1: 0.900). Moderate results for gingiva (F1: 0.760) and lip (F1: 0.706) suggest potential for further refinement. To assess dataset robustness, additional models were tested, including DeepLabv3 (ResNet-50), FCN (ResNet-50/101), PSPNet (ResNet-50/VGG16), and U-Net (EfficientNet-B0/ResNet-50). DeepLabv3 (ResNet-101) and U-Net (EfficientNet-B0) emerged as top performers, particularly in retractor, mucosa, and tooth segmentation.

Conclusions:

Conducting an in-depth hyperspectral analysis of oral tissue has facilitated the development of robust deep learning algorithms, thereby enhancing diagnostic accuracy and clinical applicability. This study integrates advanced imaging technologies with deep learning, offering significant potential to enhance non-invasive detection, classification, and therefore individualized treatment of oral diseases. Clinical Trial: Deutsche Forschungsgemeinschaft (DFG) - Projectnumber 516210826 https://gepris.dfg.de/gepris/projekt/516210826


 Citation

Please cite as:

Roemer P, Ponciano JJ, Kloster K, Siegberg F, Plaß B, Vinayahalingam S, Al-Nawas B, Kaemmerer P, Thomas K, Thiem D

Enhancing Oral Health Diagnostics With Hyperspectral Imaging and Computer Vision: Clinical Dataset Study

JMIR Med Inform 2025;13:e76148

DOI: 10.2196/76148

PMID: 40935589

PMCID: 12425605

Download PDF


Request queued. Please wait while the file is being generated. It may take some time.

© The authors. All rights reserved. This is a privileged document currently under peer-review/community review (or an accepted/rejected manuscript). Authors have provided JMIR Publications with an exclusive license to publish this preprint on it's website for review and ahead-of-print citation purposes only. While the final peer-reviewed paper may be licensed under a cc-by license on publication, at this stage authors and publisher expressively prohibit redistribution of this draft paper other than for review purposes.