CNSL-bench: Benchmarking the Sign Language Understanding Capabilities of MLLMs on Chinese National Sign Language

Rui Zhao; Xuewen Zhong; Xiaoyun Zheng; Jinsong Su; Yidong Chen

arXiv:2604.22367·cs.CL·April 27, 2026

CNSL-bench: Benchmarking the Sign Language Understanding Capabilities of MLLMs on Chinese National Sign Language

Rui Zhao, Xuewen Zhong, Xiaoyun Zheng, Jinsong Su, Yidong Chen

PDF

TL;DR

This paper introduces CNSL-bench, a comprehensive benchmark for evaluating multimodal large language models' understanding of Chinese National Sign Language, highlighting current models' significant performance gaps compared to humans.

Contribution

The paper presents CNSL-bench, the first standardized, multimodal Chinese sign language benchmark grounded in official dictionaries, enabling detailed evaluation of MLLMs' sign language understanding capabilities.

Findings

01

Current MLLMs perform substantially worse than humans in sign language understanding.

02

Models show systematic disparities across input modalities and manual articulatory forms.

03

Performance limitations persist beyond reasoning improvements, with variable robustness to instructions.

Abstract

Sign language research has achieved significant progress due to the advances in large language models (LLMs). However, the intrinsic ability of LLMs to understand sign language, especially in multimodal contexts, remains underexplored. To address this limitation, we introduce CNSL-bench, the first comprehensive Chinese em{National Sign Language benchmark designed for evaluating multimodal large language models (MLLMs) in sign language understanding. The proposed CNSL-bench is characterized by: 1) Authoritative grounding, as it is anchored to the officially standardized \textit{National Common Sign Language Dictionary, mitigating ambiguity from regional or non-canonical variants and ensuring consistent semantic definitions; 2) Multimodal coverage, providing aligned textual descriptions, illustrative images, and sign language videos; and 3) Articulatory diversity, supporting fine-grained…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.