Cross-lingual Opinions and Emotions Mining in Comparable Documents
Motaz Saad, David Langlois, Kamel Smaili

TL;DR
This paper presents a language-independent method for cross-lingual opinion and emotion analysis in comparable documents, revealing source-dependent sentiment and emotion alignment across English-Arabic news articles.
Contribution
It introduces a novel approach to annotate sentiments and emotions in comparable texts without machine translation, using bilingual lexicons and statistical agreement measures.
Findings
Sentiment and emotion annotations align within the same news source.
Annotations diverge across different news sources.
Method is applicable to other language pairs.
Abstract
Comparable texts are topic-aligned documents in multiple languages that are not direct translations. They are valuable for understanding how a topic is discussed across languages. This research studies differences in sentiments and emotions across English-Arabic comparable documents. First, texts are annotated with sentiment and emotion labels. We apply a cross-lingual method to label documents with opinion classes (subjective/objective), avoiding reliance on machine translation. To annotate with emotions (anger, disgust, fear, joy, sadness, surprise), we manually translate the English WordNet-Affect (WNA) lexicon into Arabic, creating bilingual emotion lexicons used to label the comparable corpora. We then apply a statistical measure to assess the agreement of sentiments and emotions in each source-target document pair. This comparison is especially relevant when the documents…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
