Still Between Us? Evaluating and Improving Voice Assistant Robustness to Third-Party Interruptions

Dongwook Lee; Eunwoo Song; Che Hyun Lee; Heeseung Kim; Sungroh Yoon

arXiv:2604.17358·cs.CL·April 21, 2026

Still Between Us? Evaluating and Improving Voice Assistant Robustness to Third-Party Interruptions

Dongwook Lee, Eunwoo Song, Che Hyun Lee, Heeseung Kim, Sungroh Yoon

PDF

1 Repo

TL;DR

This paper introduces TPI-Train and TPI-Bench, new datasets and evaluation tools to improve voice assistants' ability to handle third-party interruptions by emphasizing acoustic cues over semantic context.

Contribution

It presents novel datasets and evaluation frameworks that enhance speaker discrimination and interruption handling in spoken language models.

Findings

01

Dataset design reduces semantic shortcut learning.

02

Framework effectively measures interruption handling and speaker discrimination.

03

Code is publicly available at https://tpi-va.github.io

Abstract

While recent Spoken Language Models (SLMs) have been actively deployed in real-world scenarios, they lack the capability to discern Third-Party Interruptions (TPI) from the primary user's ongoing flow, leaving them vulnerable to contextual failures. To bridge this gap, we introduce TPI-Train, a dataset of 88K instances designed with speaker-aware hard negatives to enforce acoustic cue prioritization for interruption handling, and TPI-Bench, a comprehensive evaluation framework designed to rigorously measure the interruption-handling strategy and precise speaker discrimination in deceptive contexts. Experiments demonstrate that our dataset design mitigates semantic shortcut learning-a critical pitfall where models exploit semantic context while neglecting acoustic signals essential for discerning speaker changes. We believe our work establishes a foundational resource for overcoming…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

https://tpi-va.github.io
github

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.