A Generalist Audio Foundation Model for Comprehensive Body Sound Auscultation
Pingjie Wang, Liudan Zhao, Zihan Zhao, Miao He, Xin Sun, Ya Zhang, Kun, Sun, Yanfeng Wang, Yu Wang

TL;DR
AuscultaBase is a new AI framework that uses advanced learning techniques and large-scale data to improve body sound analysis for better diagnosis, especially in resource-limited settings.
Contribution
It introduces AuscultaBase, a novel AI model leveraging self-supervised and contrastive learning with multi-source data for comprehensive body sound analysis.
Findings
Outperforms existing methods on AuscultaBench benchmark
Enhances abnormality detection accuracy
Improves disease classification and activity recognition
Abstract
Accurate and efficient auscultation-based diagnostics are vital for early disease detection, especially in resource-limited settings where specialized clinical expertise is scarce. Traditional auscultation, which heavily depends on clinician experience, suffers from significant inter-observer variability, while existing AI models often falter due to the limitations of non-representative training data. In this study, we introduce AuscultaBase, a novel AI-driven diagnostic framework that harnesses self-supervised and contrastive learning techniques alongside large-scale, multi-source data integration to advance body sound analysis. By generating robust feature representations, AuscultaBase markedly enhances performance in abnormality detection, disease classification, and activity recognition tasks. Comprehensive evaluations on our newly established benchmark, AuscultaBench, demonstrate…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPhonocardiography and Auscultation Techniques
