Unifying Deep Predicate Invention with Pre-trained Foundation Models

Qianwei Wang; Bowen Li; Zhanpeng Luo; Yifan Xu; Alexander Gray; Tom Silver; Sebastian Scherer; Katia Sycara; Yaqi Xie

arXiv:2512.17992·cs.RO·December 23, 2025

Unifying Deep Predicate Invention with Pre-trained Foundation Models

Qianwei Wang, Bowen Li, Zhanpeng Luo, Yifan Xu, Alexander Gray, Tom Silver, Sebastian Scherer, Katia Sycara, Yaqi Xie

PDF

Open Access

TL;DR

UniPred is a novel bilevel learning framework that combines large language models and neural predicate learning to improve robotic task understanding and execution in complex environments.

Contribution

It unifies top-down and bottom-up predicate learning approaches using LLMs and visual features, enabling scalable and robust symbolic world models for robotics.

Findings

01

Achieves 2-4 times higher success rates than top-down methods.

02

Learns 3-4 times faster than bottom-up approaches.

03

Effective in both simulated and real-robot domains.

Abstract

Long-horizon robotic tasks are hard due to continuous state-action spaces and sparse feedback. Symbolic world models help by decomposing tasks into discrete predicates that capture object properties and relations. Existing methods learn predicates either top-down, by prompting foundation models without data grounding, or bottom-up, from demonstrations without high-level priors. We introduce UniPred, a bilevel learning framework that unifies both. UniPred uses large language models (LLMs) to propose predicate effect distributions that supervise neural predicate learning from low-level data, while learned feedback iteratively refines the LLM hypotheses. Leveraging strong visual foundation model features, UniPred learns robust predicate classifiers in cluttered scenes. We further propose a predicate evaluation method that supports symbolic models beyond STRIPS assumptions. Across five…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMultimodal Machine Learning Applications · Domain Adaptation and Few-Shot Learning · Generative Adversarial Networks and Image Synthesis