Aligning Large Language Models with Procedural Rules: An Autoregressive State-Tracking Prompting for In-Game Trading
Minkyung Kim, Junsik Kim, Woongcheol Yang, Sangdon Park, Sohee Bae

TL;DR
This paper introduces Autoregressive State-Tracking Prompting (ASTP), a method to improve large language models' adherence to procedural rules in in-game trading, ensuring trustworthiness and efficiency.
Contribution
The paper presents ASTP, a novel prompting technique that explicitly tracks and verifies state in LLMs, enhancing procedural compliance in game trading scenarios.
Findings
>99% state compliance achieved
99.3% calculation precision demonstrated
Smaller models match larger models' performance with faster response times
Abstract
Large Language Models (LLMs) enable dynamic game interactions but fail to follow essential procedural flows in rule-governed trading systems, eroding player trust. This work resolves the core tension between the creative flexibility of LLMs and the procedural demands of in-game trading (browse-offer-review-confirm). To this end, Autoregressive State-Tracking Prompting (ASTP) is introduced, a methodology centered on a strategically orchestrated prompt that compels an LLM to make its state-tracking process explicit and verifiable. Instead of relying on implicit contextual understanding, ASTP tasks the LLM with identifying and reporting a predefined state label from the previous turn. To ensure transactional integrity, this is complemented by a state-specific placeholder post-processing method for accurate price calculations. Evaluation across 300 trading dialogues demonstrates >99% state…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
