Lift3D Foundation Policy: Lifting 2D Large-Scale Pretrained Models for Robust 3D Robotic Manipulation
Yueru Jia, Jiaming Liu, Sixiang Chen, Chenyang Gu, Zhilue Wang,, Longzan Luo, Lily Lee, Pengwei Wang, Zhongyuan Wang, Renrui Zhang, Shanghang, Zhang

TL;DR
Lift3D introduces a novel framework that enhances 2D large-scale pretrained models with 3D representations, enabling robust robotic manipulation by combining implicit and explicit 3D features.
Contribution
The paper presents a new method to adapt 2D foundation models for 3D robotic tasks through a task-aware autoencoder and a model-lifting strategy, addressing data scarcity and spatial information loss.
Findings
Lift3D outperforms previous methods in simulation benchmarks.
It demonstrates strong real-world manipulation capabilities.
The approach effectively integrates 3D information into 2D models.
Abstract
3D geometric information is essential for manipulation tasks, as robots need to perceive the 3D environment, reason about spatial relationships, and interact with intricate spatial configurations. Recent research has increasingly focused on the explicit extraction of 3D features, while still facing challenges such as the lack of large-scale robotic 3D data and the potential loss of spatial geometry. To address these limitations, we propose the Lift3D framework, which progressively enhances 2D foundation models with implicit and explicit 3D robotic representations to construct a robust 3D manipulation policy. Specifically, we first design a task-aware masked autoencoder that masks task-relevant affordance patches and reconstructs depth information, enhancing the 2D foundation model's implicit 3D robotic representation. After self-supervised fine-tuning, we introduce a 2D model-lifting…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRobot Manipulation and Learning · Manufacturing Process and Optimization · Robotic Mechanisms and Dynamics
