NVSPolicy: Adaptive Novel-View Synthesis for Generalizable Language-Conditioned Policy Learning
Le Shi, Yifei Shi, Xin Xu, Tenglong Liu, Junhua Xi, and Chengyuan Chen

TL;DR
NVSPolicy introduces an adaptive novel-view synthesis approach combined with hierarchical policy learning, significantly improving generalization and success rates in robot manipulation tasks across unseen environments.
Contribution
The paper proposes NVSPolicy, a novel method integrating adaptive view synthesis with a hierarchical policy network for improved generalization in language-conditioned robot policies.
Findings
Achieves 90.4% success rate on CALVIN benchmark
Outperforms recent methods in zero-shot generalization
Effective real-world robotic application demonstrated
Abstract
Recent advances in deep generative models demonstrate unprecedented zero-shot generalization capabilities, offering great potential for robot manipulation in unstructured environments. Given a partial observation of a scene, deep generative models could generate the unseen regions and therefore provide more context, which enhances the capability of robots to generalize across unseen environments. However, due to the visual artifacts in generated images and inefficient integration of multi-modal features in policy learning, this direction remains an open challenge. We introduce NVSPolicy, a generalizable language-conditioned policy learning method that couples an adaptive novel-view synthesis module with a hierarchical policy network. Given an input image, NVSPolicy dynamically selects an informative viewpoint and synthesizes an adaptive novel-view image to enrich the visual context. To…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling
MethodsADaptive gradient method with the OPTimal convergence rate
