TL;DR
This paper introduces a bidirectional conditional encoding method for stance detection that effectively handles unseen targets and improves performance on Twitter data, achieving state-of-the-art results with weak supervision.
Contribution
It proposes a bidirectional conditional LSTM encoding approach that outperforms previous methods, especially in scenarios with unseen targets and limited training data.
Findings
Outperforms previous models on SemEval 2016 dataset
Achieves second-best performance without target-specific training data
Reaches state-of-the-art results with weak supervision
Abstract
Stance detection is the task of classifying the attitude expressed in a text towards a target such as Hillary Clinton to be "positive", negative" or "neutral". Previous work has assumed that either the target is mentioned in the text or that training data for every target is given. This paper considers the more challenging version of this task, where targets are not always mentioned and no training data is available for the test targets. We experiment with conditional LSTM encoding, which builds a representation of the tweet that is dependent on the target, and demonstrate that it outperforms encoding the tweet and the target independently. Performance is improved further when the conditional model is augmented with bidirectional encoding. We evaluate our approach on the SemEval 2016 Task 6 Twitter Stance Detection corpus achieving performance second best only to a system trained on…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsSigmoid Activation · Tanh Activation · Long Short-Term Memory
