LLM-Flax : Generalizable Robotic Task Planning via Neuro-Symbolic Approaches with Large Language Models
Seongmin Kim, Daegyu Lee

TL;DR
LLM-Flax is a three-stage neuro-symbolic framework that automates domain rule generation, failure recovery, and object importance scoring using large language models, significantly reducing manual effort in robotic task planning.
Contribution
It introduces a fully automated, LLM-driven approach for domain rule creation, failure handling, and object scoring, eliminating manual rule authoring and training data needs.
Findings
Achieves higher success rates than manual baselines across multiple benchmarks.
Successfully performs zero-shot object importance scoring without training data.
Identifies context-window limitations at scale as a key challenge for future work.
Abstract
Deploying a neuro-symbolic task planner on a new domain today requires significant manual effort: a domain expert must author relaxation and complementary rules, and hundreds of training problems must be solved to supervise a Graph Neural Network (GNN) object scorer. We propose LLM-Flax, a three-stage framework that eliminates all three sources of manual effort using a locally hosted LLM given only a PDDL domain file. Stage 1 automatically generates relaxation and complementary rules via structured prompting with format validation and self-correction. Stage 2 introduces LLM-guided failure recovery with a feasibility-gated budget policy that explicitly reserves API latency cost before each LLM call, preventing the downstream relaxation fallback from being starved. Stage 3 replaces the domain-trained GNN entirely with zero-shot LLM object importance scoring, requiring no training data. We…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
