Everybody Likes to Sleep: A Computer-Assisted Comparison of Object Naming Data from 30 Languages
Al\v{z}b\v{e}ta Ku\v{c}erov\'a, Johann-Mattis List

TL;DR
This study introduces a transparent, multilingual, computer-assisted method to compare object naming datasets across 30 languages, facilitating cross-linguistic research and understanding of cognitive and linguistic patterns.
Contribution
It links diverse object naming datasets to unified concepts, enabling systematic comparison and analysis across multiple languages and linguistic families.
Findings
Identified recurring concepts across datasets
Compared conceptual spaces with basic vocabulary lists
Provided a framework for future cross-linguistic object naming studies
Abstract
Object naming - the act of identifying an object with a word or a phrase - is a fundamental skill in interpersonal communication, relevant to many disciplines, such as psycholinguistics, cognitive linguistics, or language and vision research. Object naming datasets, which consist of concept lists with picture pairings, are used to gain insights into how humans access and select names for objects in their surroundings and to study the cognitive processes involved in converting visual stimuli into semantic concepts. Unfortunately, object naming datasets often lack transparency and have a highly idiosyncratic structure. Our study tries to make current object naming data transparent and comparable by using a multilingual, computer-assisted approach that links individual items of object naming lists to unified concepts. Our current sample links 17 object naming datasets that cover 30…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · Topic Modeling · Speech and dialogue systems
