Word Clouds as Common Voices: LLM-Assisted Visualization of Participant-Weighted Themes in Qualitative Interviews
Joseph T. Colonel, Baihan Lin

TL;DR
This paper presents ThemeClouds, an LLM-assisted visualization tool that creates participant-weighted word clouds highlighting thematic concepts in qualitative interview transcripts, improving interpretability over traditional frequency-based methods.
Contribution
Introduces ThemeClouds, a novel LLM-based tool for generating thematic, participant-weighted word clouds that enhance qualitative data analysis.
Findings
ThemeClouds surfaces more actionable insights than traditional frequency clouds.
The approach outperforms topic-modeling baselines like LDA and BERTopic.
The tool offers customizable prompts and visualization parameters for researcher control.
Abstract
Word clouds are a common way to summarize qualitative interviews, yet traditional frequency-based methods often fail in conversational contexts: they surface filler words, ignore paraphrase, and fragment semantically related ideas. This limits their usefulness in early-stage analysis, when researchers need fast, interpretable overviews of what participant actually said. We introduce ThemeClouds, an open-source visualization tool that uses large language models (LLMs) to generate thematic, participant-weighted word clouds from dialogue transcripts. The system prompts an LLM to identify concept-level themes across a corpus and then counts how many unique participants mention each topic, yielding a visualization grounded in breadth of mention rather than raw term frequency. Researchers can customize prompts and visualization parameters, providing transparency and control. Using interviews…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsComputational and Text Analysis Methods · Qualitative Research Methods and Applications · Data Visualization and Analytics
