Classification of Consumer Belief Statements From Social Media
Gerhard Johann Hagerer, Wenbin Le, Hannah Danner, Georg Groh

TL;DR
This paper investigates how expert labels and automated clustering techniques can be used for classifying consumer belief statements from social media, especially when dealing with complex, fine-grained classes and limited data.
Contribution
It compares the effectiveness of expert annotations, hierarchical clustering, and unsupervised clustering for classifying social media content in opinion mining.
Findings
Unsupervised clustering performs comparably to expert labels.
Automated class abstraction can enhance large-scale opinion mining.
Hierarchical clustering provides a viable alternative to expert annotations.
Abstract
Social media offer plenty of information to perform market research in order to meet the requirements of customers. One way how this research is conducted is that a domain expert gathers and categorizes user-generated content into a complex and fine-grained class structure. In many of such cases, little data meets complex annotations. It is not yet fully understood how this can be leveraged successfully for classification. We examine the classification accuracy of expert labels when used with a) many fine-grained classes and b) few abstract classes. For scenario b) we compare abstract class labels given by the domain expert as baseline and by automatic hierarchical clustering. We compare this to another baseline where the entire class structure is given by a completely unsupervised clustering approach. By doing so, this work can serve as an example of how complex expert annotations are…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSentiment Analysis and Opinion Mining · Advanced Text Analysis Techniques · Topic Modeling
