Discovering influential text using convolutional neural networks
Megan Ayers, Luke Sanford, Margaret Roberts, Eddie Yang

TL;DR
This paper introduces a convolutional neural network-based method for identifying influential text phrases that affect human evaluations, enhancing interpretability and discovery of causal text features in social science experiments.
Contribution
The paper presents a novel NLP interpretability technique that discovers clusters of predictive text phrases, improving over existing methods in identifying causal text effects.
Findings
The method accurately detects known causal phrases.
It discovers diverse text treatments beyond benchmark methods.
It predicts outcomes as well or better than existing approaches.
Abstract
Experimental methods for estimating the impacts of text on human evaluation have been widely used in the social sciences. However, researchers in experimental settings are usually limited to testing a small number of pre-specified text treatments. While efforts to mine unstructured texts for features that causally affect outcomes have been ongoing in recent years, these models have primarily focused on the topics or specific words of text, which may not always be the mechanism of the effect. We connect these efforts with NLP interpretability techniques and present a method for flexibly discovering clusters of similar text phrases that are predictive of human reactions to texts using convolutional neural networks. When used in an experimental setting, this method can identify text treatments and their effects under certain assumptions. We apply the method to two datasets. The first…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsAdvanced Text Analysis Techniques · Topic Modeling
