Um, er: How meaning varies between speech and its typed transcript
Harry Collins, Willow Leonard-Clarke, Hannah O'Mahoney

TL;DR
This paper investigates how the meaning of hesitancies in speech differs from their interpretation in transcripts, revealing that context and medium significantly influence perceived uncertainty and meaning.
Contribution
It introduces a novel method for analyzing how communication media alter the perceived meaning of speech elements, pioneering the concept of 'Language Code Analysis.'
Findings
Perceived uncertainty is higher in transcripts than in speech.
Hesitancies indicate different intentions in speech versus transcripts.
Almost no overlap in perceived uncertainty between spoken and transcribed versions.
Abstract
We use an extract from an interview concerning gravitational wave physics to show that the meaning of hesitancies within speech are different when spoken and when read from the corresponding transcript. When used in speech, hesitancies can indicate a pause for thought, when read in a transcript they indicate uncertainty. In a series of experiments the perceived uncertainty of the transcript was shown to be higher than the perceived uncertainty of the spoken version with almost no overlap for any respondent. We propose that finding and the method could be the beginning of a new subject we call 'Language Code Analysis' which would systematically examine how meanings change when the 'same' words are communicated via different media and symbol systems.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
