Dataset Construction via Attention for Aspect Term Extraction with Distant Supervision
Athanasios Giannakopoulos, Diego Antognini, Claudiu Musat, Andreea, Hossmann, Michael Baeriswyl

TL;DR
This paper introduces a novel attention-based method to construct a high-quality dataset for Aspect Term Extraction from review corpora, enabling improved distant supervision and performance in aspect-based sentiment analysis.
Contribution
It proposes an attention mechanism to select relevant sentences from review data for creating an ATE dataset, enhancing the quality and effectiveness of distant supervision.
Findings
Attention-based sentence selection improves dataset quality.
Model trained on constructed dataset outperforms baselines.
Selected sentences yield higher ATE accuracy than using all sentences.
Abstract
Aspect Term Extraction (ATE) detects opinionated aspect terms in sentences or text spans, with the end goal of performing aspect-based sentiment analysis. The small amount of available datasets for supervised ATE and the fact that they cover only a few domains raise the need for exploiting other data sources in new and creative ways. Publicly available review corpora contain a plethora of opinionated aspect terms and cover a larger domain spectrum. In this paper, we first propose a method for using such review corpora for creating a new dataset for ATE. Our method relies on an attention mechanism to select sentences that have a high likelihood of containing actual opinionated aspects. We thus improve the quality of the extracted aspects. We then use the constructed dataset to train a model and perform ATE with distant supervision. By evaluating on human annotated datasets, we prove that…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
