Voice-based debate with an AI adversary is associated with increased divergent ideation
Neelam Modi Jain, Dan J. Wang

TL;DR
This study compares voice and text interactions with AI in debates, revealing that voice promotes more verbose, repetitive, and ideationally diverse discourse, challenging assumptions about AI homogenizing cognition.
Contribution
It demonstrates that communication modality influences discourse structure with AI, highlighting the importance of interaction medium in cognitive effects.
Findings
Voice interactions are more verbose and repetitive than text-based exchanges.
Voice users explore a wider range of ideas through recurrent phrasing.
Text-based interactions favor concision but limit conceptual breadth.
Abstract
Concerns that interacting with generative AI homogenizes human cognition are largely based on evidence from text-based interactions, potentially conflating the effects of AI systems with those of written communication. This study examines whether these patterns depend on communication modality rather than on AI itself. Analyzing 957 open-ended debates between university students and a knowledgeable AI adversary, we show that modality corresponds to distinct structural patterns in discourse. Consistent with classic distinctions between orality and literacy, spoken interactions are significantly more verbose and exhibit greater repetition of words and phrases than text-based exchanges. This redundancy, however, is functional: voice users rely on recurrent phrasing to maintain coherence while exploring a wider range of ideas. In contrast, text-based interaction favors concision and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
