Building Decision Making Models Through Language Model Regime
Yu Zhang, Haoxiang Liu, Feijun Jiang, Weihua Luo, Kaifu Zhang

TL;DR
This paper introduces the LTU approach that leverages large language models for decision making, combining broad pre-training with targeted fine-tuning to improve generalization across diverse tasks.
Contribution
The paper presents the first practical training architecture for decision making with LLMs, enabling both single-step and multi-step tasks beyond traditional domains.
Findings
LTU outperforms supervised learning in decision making tasks
Effective in e-commerce domains like advertising and search optimization
Provides a versatile framework applicable beyond game and robot domains
Abstract
We propose a novel approach for decision making problems leveraging the generalization capabilities of large language models (LLMs). Traditional methods such as expert systems, planning algorithms, and reinforcement learning often exhibit limited generalization, typically requiring the training of new models for each unique task. In contrast, LLMs demonstrate remarkable success in generalizing across varied language tasks, inspiring a new strategy for training decision making models. Our approach, referred to as "Learning then Using" (LTU), entails a two-stage process. Initially, the \textit{learning} phase develops a robust foundational decision making model by integrating diverse knowledge from various domains and decision making contexts. The subsequent \textit{using} phase refines this foundation model for specific decision making scenarios. Distinct from other studies that employ…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques
