ConversationAlign: Open-source software for analyzing patterns of lexical use and alignment in conversation transcripts
Benjamin Sacks, Virginia Ulichney, Anna Duncan, Chelsea Helion, Sarah M. Weinstein, Tania Giovannetti, Gus Cooney, Jamie Reilly

TL;DR
This paper introduces ConversationAlign, an open-source tool for analyzing language patterns and alignment in conversation transcripts.
Contribution
The novel contribution is an R package that computes alignment indices across multiple lexical and affective dimensions in naturalistic conversations.
Findings
ConversationAlign transforms raw language data into time series for analysis of lexical alignment.
The tool identifies both local and global alignment patterns between conversation partners.
A use case with Terry Gross interviews demonstrates the package's utility in analyzing long-term conversational trends.
Abstract
Much of our scientific understanding of language processing has been informed by controlled experiments divorced from the real-world demands of naturalistic communication. Conversation requires synchronization of rate, amplitude, lexical complexity, affective coloring, shared reference, and countless other verbal and nonverbal dimensions. Conversation is not merely a vector for information transfer but also serves as a mechanism for establishing or maintaining social relationships. This process of language calibration between interlocutors is known as linguistic alignment. We developed an open-source R package, ConversationAlign, capable of computing novel indices of linguistic alignment and main effects of language use between interlocutors by evaluating word choice across numerous semantic, affective, and lexical dimensions (e.g., valence, concreteness, frequency, word length). We…
Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.
Click any figure to enlarge with its caption.
Figure 1
Figure 2
Figure 3Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsLanguage and cultural evolution · Linguistic Variation and Morphology · Neurobiology of Language and Bilingualism
