User-Guided Aspect Classification for Domain-Specific Texts
Peiran Li, Fang Guo, Jingbo Shang

TL;DR
This paper introduces ARYA, a novel framework for aspect classification in domain-specific texts that uses minimal user input and handles noisy miscellaneous aspects through iterative learning and seed updating.
Contribution
The paper presents a new framework, ARYA, that improves aspect classification by jointly modeling pre-defined and miscellaneous aspects with minimal supervision.
Findings
ARYA outperforms existing methods in two domain-specific datasets.
Proper modeling of the miscellaneous aspect significantly enhances classification accuracy.
Iterative seed updating effectively filters noisy seed words.
Abstract
Aspect classification, identifying aspects of text segments, facilitates numerous applications, such as sentiment analysis and review summarization. To alleviate the human effort on annotating massive texts, in this paper, we study the problem of classifying aspects based on only a few user-provided seed words for pre-defined aspects. The major challenge lies in how to handle the noisy misc aspect, which is designed for texts without any pre-defined aspects. Even domain experts have difficulties to nominate seed words for the misc aspect, making existing seed-driven text classification methods not applicable. We propose a novel framework, ARYA, which enables mutual enhancements between pre-defined aspects and the misc aspect via iterative classifier training and seed updating. Specifically, it trains a classifier for pre-defined aspects and then leverages it to induce the supervision…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSentiment Analysis and Opinion Mining · Advanced Text Analysis Techniques · Topic Modeling
