A Fine-Grained Sentiment Dataset for Norwegian
Lilja {\O}vrelid, Petter M{\ae}hlum, Jeremy Barnes, Erik Velldal

TL;DR
This paper introduces NoReC_fine, a detailed Norwegian sentiment dataset with annotations for polar expressions, targets, and opinion holders, derived from diverse professional reviews across multiple domains.
Contribution
It provides the first fine-grained Norwegian sentiment dataset with comprehensive annotation guidelines and initial benchmark results for future research.
Findings
High inter-annotator agreement demonstrated
Dataset covers multiple domains and review types
Preliminary experimental results established as a benchmark
Abstract
We introduce NoReC_fine, a dataset for fine-grained sentiment analysis in Norwegian, annotated with respect to polar expressions, targets and holders of opinion. The underlying texts are taken from a corpus of professionally authored reviews from multiple news-sources and across a wide variety of domains, including literature, games, music, products, movies and more. We here present a detailed description of this annotation effort. We provide an overview of the developed annotation guidelines, illustrated with examples, and present an analysis of inter-annotator agreement. We also report the first experimental results on the dataset, intended as a preliminary benchmark for further experiments.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSentiment Analysis and Opinion Mining · Topic Modeling · Advanced Text Analysis Techniques
