ADVOSYNTH: A Synthetic Multi-Advocate Dataset for Speaker Identification in Courtroom Scenarios

Aniket Deroy

arXiv:2601.10315·cs.CL·January 16, 2026

ADVOSYNTH: A Synthetic Multi-Advocate Dataset for Speaker Identification in Courtroom Scenarios

Aniket Deroy

PDF

Open Access

TL;DR

This paper presents ADVOSYNTH-500, a synthetic dataset of courtroom advocate voices designed to evaluate speaker identification systems in structured, high-fidelity speech environments.

Contribution

It introduces a novel synthetic dataset with detailed advocate voice characteristics for benchmarking speaker identification in courtroom scenarios.

Findings

01

Dataset includes 100 synthetic speech files with 10 advocate identities.

02

Simulates courtroom dialogues with five advocate pairs.

03

Provides a new benchmark for speaker identification in synthetic, structured environments.

Abstract

As large-scale speech-to-speech models achieve high fidelity, the distinction between synthetic voices in structured environments becomes a vital area of study. This paper introduces Advosynth-500, a specialized dataset comprising 100 synthetic speech files featuring 10 unique advocate identities. Using the Speech Llama Omni model, we simulate five distinct advocate pairs engaged in courtroom arguments. We define specific vocal characteristics for each advocate and present a speaker identification challenge to evaluate the ability of modern systems to map audio files to their respective synthetic origins. Dataset is available at this link-https: //github.com/naturenurtureelite/ADVOSYNTH-500.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSpeech Recognition and Synthesis · Face recognition and analysis · Speech and Audio Processing