Ear-Keeper: A Cross-Platform AI System for Rapid and Accurate Ear Disease Diagnosis

Feiyan Lu; Yubiao Yue; Zhenzhang Li; Meiping Zhang; Wen Luo; Fan Zhang; Tong Liu; Jingyong Shi; Guang Wang; Xinyu Zeng

arXiv:2308.10610·cs.CV·August 14, 2025·1 cites

Ear-Keeper: A Cross-Platform AI System for Rapid and Accurate Ear Disease Diagnosis

Feiyan Lu, Yubiao Yue, Zhenzhang Li, Meiping Zhang, Wen Luo, Fan Zhang, Tong Liu, Jingyong Shi, Guang Wang, Xinyu Zeng

PDF

Open Access

TL;DR

Ear-Keeper introduces a fast, lightweight AI system with a large, diverse dataset and novel architecture for real-time ear disease diagnosis across platforms, improving accuracy and clinical trust.

Contribution

This work presents a new large-scale otoendoscopy dataset, a novel lightweight deep learning model, and a cross-platform system for rapid, accurate ear disease detection.

Findings

01

Achieved over 95% accuracy on internal data

02

Model size of only 2.94 MB enabling deployment on various devices

03

Real-time processing at 80 frames per second on CPU

Abstract

Early and accurate detection systems for ear diseases, powered by deep learning, are essential for preventing hearing impairment and improving population health. However, the limited diversity of existing otoendoscopy datasets and the poor balance between diagnostic accuracy, computational efficiency, and model size have hindered the translation of artificial intelligence (AI) algorithms into healthcare applications. In this study, we constructed a large-scale, multi-center otoendoscopy dataset covering eight common ear diseases and healthy cases. Building upon this resource, we developed Best-EarNet, an ultrafast and lightweight deep learning architecture integrating a novel Local-Global Spatial Feature Fusion Module with a multi-scale supervision strategy, enabling real-time and accurate classification of ear conditions. Leveraging transfer learning, Best-EarNet, with a model size of…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsEar Surgery and Otitis Media · Nasal Surgery and Airway Studies · Reconstructive Facial Surgery Techniques

MethodsFocus · SPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings