Currently submitted to: JMIR Medical Informatics
Date Submitted: Mar 25, 2026
Open Peer Review Period: Apr 17, 2026 - Jun 12, 2026
(currently open for review)
Warning: This is an author submission that is not peer-reviewed or edited. Preprints - unless they show as "accepted" - should not be relied on to guide clinical practice or health-related behavior and should not be reported in news media as established information.
Diagnostic Accuracy of a Deep Learning–Assisted System for Middle Ear Diseases Compared with Otolaryngologists: Prospective Observational Study
ABSTRACT
Background:
Accurate diagnosis of middle ear diseases remains challenging due to the specialized nature of otoscopic interpretation and the global shortage of otolaryngologists.
Objective:
We aimed to evaluate the impact of a deep learning-based artificial intelligence (AI) support system on the diagnostic performance of clinicians with varying levels of expertise, specifically focusing on its ability to bridge the gap between trainees and experts.
Methods:
A deep learning model (EfficientNet-B4) was developed using a large-scale dataset of 7,335 endoscopic images. The model was designed for multi-class classification, simultaneously identifying primary categories (Chronic Otitis Media, Otitis Media with Effusion, or Normal) and secondary features (attic retraction, myringitis, otomycosis, and ventilation tube). To assess clinical impact, diagnostic performance was compared among three otology professors (OP), five senior residents (SR), and five junior residents (JR) using a clinical assessment set of 100 images. Participants performed diagnoses both with and without AI-generated predictions, with a time constraint of one minute per image to simulate real-world clinical pressure.
Results:
AI support significantly enhanced diagnostic accuracy across all groups for primary classification, with improvements of 11.0% for OP, 28.8% for SR, and 32.2% for JR (p < 0.001). Notably, the accuracy gap between the most experienced (OP) and least experienced (JR) groups was reduced from 24.3% to 3.1%. For secondary categories, significant improvements were observed in diagnosing attic retraction and myringitis among residents, with some performances even surpassing the standalone AI accuracy.
Conclusions:
Our findings demonstrate that deep learning-based AI support substantially improves the diagnostic accuracy of middle ear diseases, particularly for less experienced clinicians. By narrowing the performance gap between trainees and experts, this system shows high potential as a valuable adjunct for clinical decision-making and medical education in otology. Clinical Trial: This study complies with the Declaration of Helsinki and received research approval from the Institutional Review Board of the Asan Medical Center with a waiver of research consent (IRB no. 2021–0837).
Citation
Request queued. Please wait while the file is being generated. It may take some time.
Copyright
© The authors. All rights reserved. This is a privileged document currently under peer-review/community review (or an accepted/rejected manuscript). Authors have provided JMIR Publications with an exclusive license to publish this preprint on it's website for review and ahead-of-print citation purposes only. While the final peer-reviewed paper may be licensed under a cc-by license on publication, at this stage authors and publisher expressively prohibit redistribution of this draft paper other than for review purposes.