Building Decision Making Models Through Language Model Regime

Yu Zhang; Haoxiang Liu; Feijun Jiang; Weihua Luo; Kaifu Zhang

arXiv:2408.06087·cs.CL·August 13, 2024

Building Decision Making Models Through Language Model Regime

Yu Zhang, Haoxiang Liu, Feijun Jiang, Weihua Luo, Kaifu Zhang

PDF

Open Access

TL;DR

This paper introduces the LTU approach that leverages large language models for decision making, combining broad pre-training with targeted fine-tuning to improve generalization across diverse tasks.

Contribution

The paper presents the first practical training architecture for decision making with LLMs, enabling both single-step and multi-step tasks beyond traditional domains.

Findings

01

LTU outperforms supervised learning in decision making tasks

02

Effective in e-commerce domains like advertising and search optimization

03

Provides a versatile framework applicable beyond game and robot domains

Abstract

We propose a novel approach for decision making problems leveraging the generalization capabilities of large language models (LLMs). Traditional methods such as expert systems, planning algorithms, and reinforcement learning often exhibit limited generalization, typically requiring the training of new models for each unique task. In contrast, LLMs demonstrate remarkable success in generalizing across varied language tasks, inspiring a new strategy for training decision making models. Our approach, referred to as "Learning then Using" (LTU), entails a two-stage process. Initially, the \textit{learning} phase develops a robust foundational decision making model by integrating diverse knowledge from various domains and decision making contexts. The subsequent \textit{using} phase refines this foundation model for specific decision making scenarios. Distinct from other studies that employ…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques