SignAgent: Agentic LLMs for Linguistically-Grounded Sign Language Annotation and Dataset Curation

Oliver Cory; Ozge Mercanoglu Sincan; Richard Bowden

arXiv:2603.19059·cs.CV·March 20, 2026

SignAgent: Agentic LLMs for Linguistically-Grounded Sign Language Annotation and Dataset Curation

Oliver Cory, Ozge Mercanoglu Sincan, Richard Bowden

PDF

Open Access

TL;DR

SignAgent leverages large language models to automate and improve the linguistic annotation and dataset curation process for Sign Language, addressing limitations of traditional methods and manual efforts.

Contribution

The paper introduces SignAgent, a novel agentic framework combining reasoning and knowledge-grounded LLMs for scalable, linguistically-aware Sign Language annotation and dataset curation.

Findings

01

Effective in pseudo-gloss annotation with multi-modal evidence

02

Accurately detects and refines visual clusters for ID glossing

03

Achieves strong performance in large-scale linguistic annotation

Abstract

This paper introduces SignAgent, a novel agentic framework that utilises Large Language Models (LLMs) for scalable, linguistically-grounded Sign Language (SL) annotation and dataset curation. Traditional computational methods for SLs often operate at the gloss level, overlooking crucial linguistic nuances, while manual linguistic annotation remains a significant bottleneck, proving too slow and expensive for the creation of large-scale, phonologically-aware datasets. SignAgent addresses these challenges through SignAgent Orchestrator, a reasoning LLM that coordinates a suite of linguistic tools, and SignGraph, a knowledge-grounded LLM that provides lexical and linguistic grounding. We evaluate our framework on two downstream annotation tasks. First, on Pseudo-gloss Annotation, where the agent performs constrained assignment, using multi-modal evidence to extract and order suitable gloss…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsHand Gesture Recognition Systems · Hearing Impairment and Communication · Interactive and Immersive Displays