Large-Scale Goodness Polarity Lexicons for Community Question Answering
Todor Mihaylov, Daniel Belchev, Yasen Kiprov, Ivan Koychev, Preslav, Nakov

TL;DR
This paper introduces large-scale goodness polarity lexicons for community question answering, improving comment ranking by leveraging semi-supervised polarity lexicons inspired by sentiment analysis, leading to state-of-the-art results.
Contribution
It transfers sentiment polarity lexicon techniques to community question answering, creating large-scale goodness lexicons to enhance comment re-ranking.
Findings
Improved MAP by 0.7 points over baseline
Achieved state-of-the-art performance on SemEval-2016 Task 3
Demonstrated effectiveness of polarity lexicons in cQA
Abstract
We transfer a key idea from the field of sentiment analysis to a new domain: community question answering (cQA). The cQA task we are interested in is the following: given a question and a thread of comments, we want to re-rank the comments so that the ones that are good answers to the question would be ranked higher than the bad ones. We notice that good vs. bad comments use specific vocabulary and that one can often predict the goodness/badness of a comment even ignoring the question, based on the comment contents only. This leads us to the idea to build a good/bad polarity lexicon as an analogy to the positive/negative sentiment polarity lexicons, commonly used in sentiment analysis. In particular, we use pointwise mutual information in order to build large-scale goodness polarity lexicons in a semi-supervised manner starting with a small number of initial seeds. The evaluation…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
