Ear-Keeper: A Cross-Platform AI System for Rapid and Accurate Ear Disease Diagnosis
Feiyan Lu, Yubiao Yue, Zhenzhang Li, Meiping Zhang, Wen Luo, Fan Zhang, Tong Liu, Jingyong Shi, Guang Wang, Xinyu Zeng

TL;DR
Ear-Keeper introduces a fast, lightweight AI system with a large, diverse dataset and novel architecture for real-time ear disease diagnosis across platforms, improving accuracy and clinical trust.
Contribution
This work presents a new large-scale otoendoscopy dataset, a novel lightweight deep learning model, and a cross-platform system for rapid, accurate ear disease detection.
Findings
Achieved over 95% accuracy on internal data
Model size of only 2.94 MB enabling deployment on various devices
Real-time processing at 80 frames per second on CPU
Abstract
Early and accurate detection systems for ear diseases, powered by deep learning, are essential for preventing hearing impairment and improving population health. However, the limited diversity of existing otoendoscopy datasets and the poor balance between diagnostic accuracy, computational efficiency, and model size have hindered the translation of artificial intelligence (AI) algorithms into healthcare applications. In this study, we constructed a large-scale, multi-center otoendoscopy dataset covering eight common ear diseases and healthy cases. Building upon this resource, we developed Best-EarNet, an ultrafast and lightweight deep learning architecture integrating a novel Local-Global Spatial Feature Fusion Module with a multi-scale supervision strategy, enabling real-time and accurate classification of ear conditions. Leveraging transfer learning, Best-EarNet, with a model size of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsEar Surgery and Otitis Media · Nasal Surgery and Airway Studies · Reconstructive Facial Surgery Techniques
MethodsFocus · SPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings
