Everybody Likes to Sleep: A Computer-Assisted Comparison of Object Naming Data from 30 Languages

Al\v{z}b\v{e}ta Ku\v{c}erov\'a; Johann-Mattis List

arXiv:2501.08312·cs.CL·August 22, 2025

Everybody Likes to Sleep: A Computer-Assisted Comparison of Object Naming Data from 30 Languages

Al\v{z}b\v{e}ta Ku\v{c}erov\'a, Johann-Mattis List

PDF

Open Access 1 Repo

TL;DR

This study introduces a transparent, multilingual, computer-assisted method to compare object naming datasets across 30 languages, facilitating cross-linguistic research and understanding of cognitive and linguistic patterns.

Contribution

It links diverse object naming datasets to unified concepts, enabling systematic comparison and analysis across multiple languages and linguistic families.

Findings

01

Identified recurring concepts across datasets

02

Compared conceptual spaces with basic vocabulary lists

03

Provided a framework for future cross-linguistic object naming studies

Abstract

Object naming - the act of identifying an object with a word or a phrase - is a fundamental skill in interpersonal communication, relevant to many disciplines, such as psycholinguistics, cognitive linguistics, or language and vision research. Object naming datasets, which consist of concept lists with picture pairings, are used to gain insights into how humans access and select names for objects in their surroundings and to study the cognitive processes involved in converting visual stimuli into semantic concepts. Unfortunately, object naming datasets often lack transparency and have a highly idiosyncratic structure. Our study tries to make current object naming data transparent and comparable by using a multilingual, computer-assisted approach that links individual items of object naming lists to unified concepts. Our current sample links 17 object naming datasets that cover 30…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

calc-project/object-naming-data
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling · Speech and dialogue systems