Speaker Diarization with Overlapping Community Detection Using Graph Attention Networks and Label Propagation Algorithm
Zhaoyang Li, Jie Wang, XiaoXiao Li, Wangjie Li, Longjie Luo, Lin Li, Qingyang Hong

TL;DR
This paper introduces a novel graph attention network and label propagation-based method for speaker diarization that effectively handles overlapping speech segments, significantly reducing error rates in challenging datasets.
Contribution
The paper presents OCDGALP, a new framework combining graph attention networks and label propagation for improved overlapping speaker diarization.
Findings
Achieved 15.94% DER on DIHARD-III without oracle VAD.
Achieved 11.07% DER with oracle VAD.
Outperformed traditional clustering methods in overlapping speech scenarios.
Abstract
In speaker diarization, traditional clustering-based methods remain widely used in real-world applications. However, these methods struggle with the complex distribution of speaker embeddings and overlapping speech segments. To address these limitations, we propose an Overlapping Community Detection method based on Graph Attention networks and the Label Propagation Algorithm (OCDGALP). The proposed framework comprises two key components: (1) a graph attention network that refines speaker embeddings and node connections by aggregating information from neighboring nodes, and (2) a label propagation algorithm that assigns multiple community labels to each node, enabling simultaneous clustering and overlapping community detection. Experimental results show that the proposed method significantly reduces the Diarization Error Rate (DER), achieving a state-of-the-art 15.94% DER on the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSpeech Recognition and Synthesis
MethodsSoftmax · Attention Is All You Need
