Double Mixture: Towards Continual Event Detection from Speech
Jingqi Kang, Tongtong Wu, Jinming Zhao, Guitao Wang, Yinwei Wei, Hao, Yang, Guilin Qi, Yuan-Fang Li, Gholamreza Haffari

TL;DR
This paper introduces a new task and benchmark datasets for continual speech event detection, proposing the 'Double Mixture' method to improve adaptability and prevent forgetting in detecting semantic and acoustic events from speech.
Contribution
It presents the first continual speech event detection task, along with benchmark datasets and a novel 'Double Mixture' method that combines speech expertise with memory mechanisms.
Findings
Achieves lowest forgetting rates among compared methods
Demonstrates high generalization across different sequences
Outperforms existing methods in continual learning scenarios
Abstract
Speech event detection is crucial for multimedia retrieval, involving the tagging of both semantic and acoustic events. Traditional ASR systems often overlook the interplay between these events, focusing solely on content, even though the interpretation of dialogue can vary with environmental context. This paper tackles two primary challenges in speech event detection: the continual integration of new events without forgetting previous ones, and the disentanglement of semantic from acoustic events. We introduce a new task, continual event detection from speech, for which we also provide two benchmark datasets. To address the challenges of catastrophic forgetting and effective disentanglement, we propose a novel method, 'Double Mixture.' This method merges speech expertise with robust memory mechanisms to enhance adaptability and prevent forgetting. Our comprehensive experiments show…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Text Analysis Techniques · Sentiment Analysis and Opinion Mining · Speech Recognition and Synthesis
