Schema Key Wording as an Instruction Channel in Structured Generation under Constrained Decoding

Yifan Le

arXiv:2604.14862·cs.CL·April 29, 2026

Schema Key Wording as an Instruction Channel in Structured Generation under Constrained Decoding

Yifan Le

PDF

TL;DR

This paper investigates how schema key tokens serve as an implicit instruction channel in constrained decoding for structured generation, revealing their significant impact on model accuracy and behavior.

Contribution

It introduces a systematic study of schema keys as instruction channels, providing a theoretical analysis and empirical evidence of their influence across different models.

Findings

01

Schema key wording significantly affects accuracy in structured generation.

02

Qwen models benefit more from schema-level instructions, LLaMA models rely more on prompts.

03

Schema design is integral to instruction specification, not just output formatting.

Abstract

Constrained decoding is widely used to make large language models produce structured outputs that satisfy schemas such as JSON. Existing work mainly treats schemas as structural constraints, overlooking that schema-key tokens also enter the autoregressive context and may guide generation. To the best of our knowledge, we present the first systematic study of schema keys as an implicit instruction channel under constrained decoding. We formulate structured generation as a multi-channel instruction problem, where task signals can be placed in prompts, schema keys, or both. We further provide a projection-aware analysis: a CoT-style key helps only when its semantic gain exceeds the distortion induced by grammar-constrained projection, offering a theoretical explanation for model-dependent key effects. Experiments on mathematical reasoning benchmarks show that changing only schema-key…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.