PAP900: A dataset of semantic relationships between affective words in Portuguese
André Fernandes dos Santos, José Paulo Leal, Rui Alexandre Alves, Teresa Jacques

TL;DR
PAP900 is a Portuguese dataset of 900 affective word pairs annotated for semantic similarity and relatedness by over 30 raters each.
Contribution
PAP900 is the first Portuguese dataset focusing on affective words with detailed annotations and annotator sociodemographics.
Findings
The dataset includes semantic similarity and relatedness ratings for 900 affective word pairs.
Annotator sociodemographics are included to study their influence on semantic perception.
The dataset is available in multiple formats for diverse research needs.
Abstract
The PAP900 dataset centers on the semantic relationship between affective words in Portuguese. It contains 900 word pairs, each annotated by at least 30 human raters for both semantic similarity and semantic relatedness. In addition to the semantic ratings, the dataset includes the word categorization used to build the word pairs and detailed sociodemographic information about annotators, enabling the analysis of the influence of personal factors on the perception of semantic relationships. Furthermore, this article describes in detail the dataset construction process, from word selection to agreement metrics. Data was collected from Portuguese university psychology students, who completed two rounds of questionnaires. In the first round annotators were asked to rate word pairs on either semantic similarity or relatedness. The second round switched the relation type for most…
Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.
Click any figure to enlarge with its caption.
Figure 1
Figure 2
Figure 3Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSentiment Analysis and Opinion Mining · Topic Modeling · Neurobiology of Language and Bilingualism
