Continual Audio-Visual Sound Separation
Weiguo Pian, Yiyang Nan, Shijian Deng, Shentong Mo, Yunhui Guo, Yapeng, Tian

TL;DR
This paper introduces a continual audio-visual sound separation task and proposes a novel method, ContAV-Sep, with a cross-modal similarity distillation constraint to mitigate catastrophic forgetting and improve separation performance in evolving environments.
Contribution
The paper presents a new continual learning framework for audio-visual sound separation, introducing a cross-modal similarity distillation constraint to preserve knowledge across tasks.
Findings
ContAV-Sep effectively mitigates catastrophic forgetting.
The method outperforms existing continual learning baselines.
It maintains cross-modal semantic similarity across tasks.
Abstract
In this paper, we introduce a novel continual audio-visual sound separation task, aiming to continuously separate sound sources for new classes while preserving performance on previously learned classes, with the aid of visual guidance. This problem is crucial for practical visually guided auditory perception as it can significantly enhance the adaptability and robustness of audio-visual sound separation models, making them more applicable for real-world scenarios where encountering new sound sources is commonplace. The task is inherently challenging as our models must not only effectively utilize information from both modalities in current tasks but also preserve their cross-modal association in old tasks to mitigate catastrophic forgetting during audio-visual continual learning. To address these challenges, we propose a novel approach named ContAV-Sep (\textbf{Cont}inual…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsSpeech and Audio Processing · Image and Signal Denoising Methods
MethodsSeventeen Ways to Call Uphold Helpline Full Guide USA 24 Hour Assistance
