Automata-Based Steering of Large Language Models for Diverse Structured Generation

Xiaokun Luan; Zeming Wei; Yihao Zhang; Meng Sun

arXiv:2511.11018·cs.CL·November 17, 2025

Automata-Based Steering of Large Language Models for Diverse Structured Generation

Xiaokun Luan, Zeming Wei, Yihao Zhang, Meng Sun

PDF

Open Access

TL;DR

This paper introduces an automaton-based method to improve the diversity of structured outputs generated by large language models, addressing the common limitation of low diversity in structured generation.

Contribution

We propose a novel automaton traversal approach that guides LLMs to produce more diverse structured outputs without sacrificing efficiency.

Findings

01

Significant increase in structural diversity

02

Enhanced content variety in generated outputs

03

Maintained generation efficiency

Abstract

Large language models (LLMs) are increasingly tasked with generating structured outputs. While structured generation methods ensure validity, they often lack output diversity, a critical limitation that we confirm in our preliminary study. We propose a novel method to enhance diversity in automaton-based structured generation. Our approach utilizes automata traversal history to steer LLMs towards novel structural patterns. Evaluations show our method significantly improves structural and content diversity while maintaining comparable generation efficiency. Furthermore, we conduct a case study showcasing the effectiveness of our method in generating diverse test cases for testing open-source libraries.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSoftware Engineering Research · Topic Modeling · Natural Language Processing Techniques