Foundation Models for Decision Making: Problems, Methods, and Opportunities
Sherry Yang, Ofir Nachum, Yilun Du, Jason Wei, Pieter Abbeel, Dale, Schuurmans

TL;DR
This paper reviews how foundation models are evolving to support decision making in complex, real-world applications by integrating interaction, reasoning, and planning capabilities across diverse domains.
Contribution
It provides a comprehensive overview of recent methods and challenges in applying foundation models to decision making tasks across multiple disciplines.
Findings
Foundation models are increasingly used for decision making in real-world environments.
Recent approaches include prompting, generative modeling, planning, and reinforcement learning.
Open problems involve grounding models in practical decision contexts.
Abstract
Foundation models pretrained on diverse data at scale have demonstrated extraordinary capabilities in a wide range of vision and language tasks. When such models are deployed in real world environments, they inevitably interface with other entities and agents. For example, language models are often used to interact with human beings through dialogue, and visual perception models are used to autonomously navigate neighborhood streets. In response to these developments, new paradigms are emerging for training foundation models to interact with other agents and perform long-term reasoning. These paradigms leverage the existence of ever-larger datasets curated for multimodal, multitask, and generalist interaction. Research at the intersection of foundation models and decision making holds tremendous promise for creating powerful new systems that can interact effectively across a diverse…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMultimodal Machine Learning Applications · Human-Automation Interaction and Safety · Explainable Artificial Intelligence (XAI)
