Stay on Topic, Please: Aligning User Comments to the Content of a News Article
Jumanah Alshehri, Marija Stanojevic, Eduard Dragut, Zoran Obradovic

TL;DR
This paper introduces a BERT-based classification method to determine the relevance of user comments to news articles, significantly improving accuracy and addressing the challenge of aligning informal comments with formal news content.
Contribution
The study presents a novel BERTAC model with an ordinal loss function for better comment-article relevance classification, outperforming existing baselines.
Findings
Up to 36% accuracy improvement over baselines
The proposed loss influences the learning process positively
Moderate human agreement indicates the task's difficulty
Abstract
Social scientists have shown that up to 50% if the content posted to a news article have no relation to its journalistic content. In this study we propose a classification algorithm to categorize user comments posted to a new article base don their alignment to its content. The alignment seek to match user comments to an article based on similarity off content, entities in discussion, and topic. We proposed a BERTAC, BAERT-based approach that learn jointly article-comment embeddings and infers the relevance class of comments. We introduce an ordinal classification loss that penalizes the difference between the predicted and true label. We conduct a thorough study to show influence of the proposed loss on the learning process. The results on five representative news outlets show that our approach can learn the comment class with up to 36% average accuracy improvement compering to the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
