WMDecompose: A Framework for Leveraging the Interpretable Properties of Word Mover's Distance in Sociocultural Analysis
Mikael Brunila, Jack LaViolette

TL;DR
WMDecompose enhances the interpretability of Word Mover's Distance by decomposing document distances into word-level components and clustering them to reveal thematic insights, demonstrated on social media data.
Contribution
It introduces WMDecompose, a novel framework and Python library that decomposes WMD into interpretable word-level distances and clusters them for sociocultural analysis.
Findings
Decomposes document distances into word-level components.
Clusters words to identify thematic elements.
Applied to social media data on conspiracy theories.
Abstract
Despite the increasing popularity of NLP in the humanities and social sciences, advances in model performance and complexity have been accompanied by concerns about interpretability and explanatory power for sociocultural analysis. One popular model that balances complexity and legibility is Word Mover's Distance (WMD). Ostensibly adapted for its interpretability, WMD has nonetheless been used and further developed in ways which frequently discard its most interpretable aspect: namely, the word-level distances required for translating a set of words into another set of words. To address this apparent gap, we introduce WMDecompose: a model and Python library that 1) decomposes document-level distances into their constituent word-level distances, and 2) subsequently clusters words to induce thematic elements, such that useful lexical information is retained and summarized for analysis. To…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Computational and Text Analysis Methods · Misinformation and Its Impacts
Methods7 Fastest Ways to Call American Airlines Reservations Number (USA Guide)
