PSALM-V: Automating Symbolic Planning in Interactive Visual Environments with Large Language Models
Wang Bill Zhu, Miaosen Chai, Ishika Singh, Robin Jia, Jesse Thomason

TL;DR
PSALM-V is an autonomous neuro-symbolic system that uses large language models to induce symbolic action semantics in visual environments, significantly improving planning success rates without expert input.
Contribution
It introduces PSALM-V, the first system to dynamically infer symbolic action semantics in visual environments through interaction and iterative refinement using LLMs.
Findings
Increases plan success rate from 37% to 74% in ALFRED.
Improves step efficiency in 2D game environments.
Successfully induces PDDL semantics for real-world robot tasks.
Abstract
We propose PSALM-V, the first autonomous neuro-symbolic learning system able to induce symbolic action semantics (i.e., pre- and post-conditions) in visual environments through interaction. PSALM-V bootstraps reliable symbolic planning without expert action definitions, using LLMs to generate heuristic plans and candidate symbolic semantics. Previous work has explored using large language models to generate action semantics for Planning Domain Definition Language (PDDL)-based symbolic planners. However, these approaches have primarily focused on text-based domains or relied on unrealistic assumptions, such as access to a predefined problem file, full observability, or explicit error messages. By contrast, PSALM-V dynamically infers PDDL problem files and domain action semantics by analyzing execution outcomes and synthesizing possible error explanations. The system iteratively generates…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsArtificial Intelligence in Games · Natural Language Processing Techniques · AI-based Problem Solving and Planning
