ConcateNet: Dialogue Separation Using Local And Global Feature   Concatenation

Mhd Modar Halimeh; Matteo Torcoli; Emanu\"el Habets

arXiv:2408.08729·eess.AS·August 19, 2024

ConcateNet: Dialogue Separation Using Local And Global Feature Concatenation

Mhd Modar Halimeh, Matteo Torcoli, Emanu\"el Habets

PDF

Open Access

TL;DR

ConcateNet introduces a novel architecture for dialogue separation that effectively combines local and global features, demonstrating superior generalization to out-of-domain signals compared to existing noise-reduction methods.

Contribution

The paper presents ConcateNet, a new approach that enhances dialogue separation by processing local and global features for improved out-of-domain generalization.

Findings

01

Competitive performance on in-domain datasets.

02

Superior out-of-domain generalization compared to state-of-the-art methods.

03

Effective processing of local and global features for dialogue separation.

Abstract

Dialogue separation involves isolating a dialogue signal from a mixture, such as a movie or a TV program. This can be a necessary step to enable dialogue enhancement for broadcast-related applications. In this paper, ConcateNet for dialogue separation is proposed, which is based on a novel approach for processing local and global features aimed at better generalization for out-of-domain signals. ConcateNet is trained using a noise reduction-focused, publicly available dataset and evaluated using three datasets: two noise reduction-focused datasets (in-domain), which show competitive performance for ConcateNet, and a broadcast-focused dataset (out-of-domain), which verifies the better generalization performance for the proposed architecture compared to considered state-of-the-art noise-reduction methods.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSpeech and dialogue systems · Natural Language Processing Techniques · Topic Modeling