Small Stickers, Big Meanings: A Multilingual Sticker Semantic Understanding Dataset with a Gamified Approach

Heng Er Metilda Chee; Jiayin Wang; Zhiqiang Guo; Weizhi Ma; Min Zhang

arXiv:2506.01668·cs.MM·September 29, 2025

Small Stickers, Big Meanings: A Multilingual Sticker Semantic Understanding Dataset with a Gamified Approach

Heng Er Metilda Chee, Jiayin Wang, Zhiqiang Guo, Weizhi Ma, Min Zhang

PDF

TL;DR

This paper introduces a multilingual sticker query dataset and a gamified annotation framework to improve sticker retrieval and understanding, addressing challenges of subjectivity and dataset construction in visual expression analysis.

Contribution

It presents a novel gamified annotation method, a multilingual sticker query dataset, and fine-tuned models that enhance sticker retrieval and semantic understanding.

Findings

01

Enhanced query generation quality

02

Improved retrieval accuracy

03

Effective semantic understanding

Abstract

Stickers, though small, are a highly condensed form of visual expression, ubiquitous across messaging platforms and embraced by diverse cultures, genders, and age groups. Despite their popularity, sticker retrieval remains an underexplored task due to the significant human effort and subjectivity involved in constructing high-quality sticker query datasets. Although large language models (LLMs) excel at general NLP tasks, they falter when confronted with the nuanced, intangible, and highly specific nature of sticker query generation. To address this challenge, we propose a threefold solution. First, we introduce Sticktionary, a gamified annotation framework designed to gather diverse, high-quality, and contextually resonant sticker queries. Second, we present StickerQueries, a multilingual sticker query dataset containing 1,115 English and 615 Chinese queries, annotated by over 60…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.