Lost in Speech: Benchmarking, Evaluation, and Parsing of Spoken Code-Switching Beyond Standard UD Assumptions
Nemika Tyagi, Holly Hendrix, Nelvin Licona-Guevara, Justin Mackie, Phanos Kareen, Muhammad Imran, Megan Michelle Smith, Tatiana Gallego Hernande, Chitta Baral, Olga Kellert

TL;DR
This paper addresses the challenges of syntactic parsing in spoken code-switching by introducing a new benchmark, an ambiguity-aware evaluation metric, and a novel parsing framework that significantly improves robustness and interpretability.
Contribution
It presents a linguistically grounded taxonomy, a new benchmark SpokeBench, the FLEX-UD evaluation metric, and the DECAP parsing framework for better spoken CSW analysis beyond standard UD assumptions.
Findings
Existing parsers perform poorly on spoken CSW data.
DECAP improves parsing robustness by up to 52.6%.
FLEX-UD reveals qualitative improvements masked by standard metrics.
Abstract
Spoken code-switching (CSW) challenges syntactic parsing in ways not observed in written text. Disfluencies, repetition, ellipsis, and discourse-driven structure routinely violate standard Universal Dependencies (UD) assumptions, causing parsers and large language models (LLMs) to fail despite strong performance on written data. These failures are compounded by rigid evaluation metrics that conflate genuine structural errors with acceptable variation. In this work, we present a systems-oriented approach to spoken CSW parsing. We introduce a linguistically grounded taxonomy of spoken CSW phenomena and SpokeBench, an expert-annotated gold benchmark designed to test spoken-language structure beyond standard UD assumptions. We further propose FLEX-UD, an ambiguity-aware evaluation metric, which reveals that existing parsing techniques perform poorly on spoken CSW by penalizing…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · Neurobiology of Language and Bilingualism · Topic Modeling
