Double Mixture: Towards Continual Event Detection from Speech

Jingqi Kang; Tongtong Wu; Jinming Zhao; Guitao Wang; Yinwei Wei; Hao; Yang; Guilin Qi; Yuan-Fang Li; Gholamreza Haffari

arXiv:2404.13289·cs.CL·October 29, 2024

Double Mixture: Towards Continual Event Detection from Speech

Jingqi Kang, Tongtong Wu, Jinming Zhao, Guitao Wang, Yinwei Wei, Hao, Yang, Guilin Qi, Yuan-Fang Li, Gholamreza Haffari

PDF

Open Access 1 Repo

TL;DR

This paper introduces a new task and benchmark datasets for continual speech event detection, proposing the 'Double Mixture' method to improve adaptability and prevent forgetting in detecting semantic and acoustic events from speech.

Contribution

It presents the first continual speech event detection task, along with benchmark datasets and a novel 'Double Mixture' method that combines speech expertise with memory mechanisms.

Findings

01

Achieves lowest forgetting rates among compared methods

02

Demonstrates high generalization across different sequences

03

Outperforms existing methods in continual learning scenarios

Abstract

Speech event detection is crucial for multimedia retrieval, involving the tagging of both semantic and acoustic events. Traditional ASR systems often overlook the interplay between these events, focusing solely on content, even though the interpretation of dialogue can vary with environmental context. This paper tackles two primary challenges in speech event detection: the continual integration of new events without forgetting previous ones, and the disentanglement of semantic from acoustic events. We introduce a new task, continual event detection from speech, for which we also provide two benchmark datasets. To address the challenges of catastrophic forgetting and effective disentanglement, we propose a novel method, 'Double Mixture.' This method merges speech expertise with robust memory mechanisms to enhance adaptability and prevent forgetting. Our comprehensive experiments show…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

jodie-kang/doublemixture
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Text Analysis Techniques · Sentiment Analysis and Opinion Mining · Speech Recognition and Synthesis