BoSS: Beyond-Semantic Speech
Qing Wang, Zehan Li, Hang Lv, Hongjie Chen, Yaodong Song, Jian Kang, Jie Lian, Jie Li, Yongxiang Li, Zhongjiang He, Xuelong Li

TL;DR
This paper introduces a hierarchical framework for speech capabilities and a novel concept called Beyond-Semantic Speech (BoSS), aiming to capture implicit signals, emotions, and context in speech to improve human-machine communication.
Contribution
It formalizes the concept of BoSS, proposes a framework for analyzing beyond-semantic signals, and evaluates current models' ability to interpret these complex speech features.
Findings
Current spoken language models struggle to interpret beyond-semantic signals.
BoSS encompasses emotions, context, and implicit semantics in speech communication.
Advancing BoSS research is essential for richer, context-aware human-machine interactions.
Abstract
Human communication involves more than explicit semantics, with implicit signals and contextual cues playing a critical role in shaping meaning. However, modern speech technologies, such as Automatic Speech Recognition (ASR) and Text-to-Speech (TTS) often fail to capture these beyond-semantic dimensions. To better characterize and benchmark the progression of speech intelligence, we introduce Spoken Interaction System Capability Levels (L1-L5), a hierarchical framework illustrated the evolution of spoken dialogue systems from basic command recognition to human-like social interaction. To support these advanced capabilities, we propose Beyond-Semantic Speech (BoSS), which refers to the set of information in speech communication that encompasses but transcends explicit semantics. It conveys emotions, contexts, and modifies or extends meanings through multidimensional features such as…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSemantic Web and Ontologies · Multi-Agent Systems and Negotiation · Robotics and Automated Systems
