Do AI Voices Learn Social Nuances? A Case of Politeness and Speech Rate
Eyal Rabin, Zohar Elyoseph, Rotem Israel-Fishelson, Adi Dali, Ravit Nussinson

TL;DR
This study shows that advanced text-to-speech AI systems can learn and reproduce social nuances like politeness through speech rate, indicating their potential to understand and embody human social norms.
Contribution
It provides empirical evidence that state-of-the-art AI voices can implicitly learn social cues such as politeness by adjusting speech rate, a novel insight into AI social competence.
Findings
Polite prompts lead to slower speech in AI voices.
Significant speech rate differences observed across platforms.
AI can internalize and reproduce social communication cues.
Abstract
Voice-based artificial intelligence is increasingly expected to adhere to human social conventions, but can it learn implicit cues that are not explicitly programmed? This study investigates whether state-of-the-art text-to-speech systems have internalized the human tendency to reduce speech rate to convey politeness - a non-obvious prosodic marker. We prompted 22 synthetic voices from two leading AI platforms (AI Studio and OpenAI) to read a fixed script under both "polite and formal" and "casual and informal" conditions and measured the resulting speech duration. Across both AI platforms, the polite prompt produced slower speech than the casual prompt with very large effect sizes, an effect that was statistically significant for all of AI Studio's voices and for a large majority of OpenAI's voices. These results demonstrate that AI can implicitly learn and replicate psychological…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAI in Service Interactions · Neurobiology of Language and Bilingualism · Action Observation and Synchronization
