Equivariant Volumetric Grasping

Pinhao Song; Yutong Hu; Pengteng Li; Renaud Detry

arXiv:2507.18847·cs.RO·May 12, 2026

Equivariant Volumetric Grasping

Pinhao Song, Yutong Hu, Pengteng Li, Renaud Detry

PDF

1 Repo

TL;DR

This paper introduces a rotation-equivariant volumetric grasp model using tri-plane features, improving sampling efficiency and outperforming non-equivariant methods in real-time robotic grasping tasks.

Contribution

It presents a novel tri-plane feature design with equivariance properties and adapts existing grasp planners to leverage these features for enhanced performance.

Findings

01

Significant improvement in sampling efficiency.

02

Higher grasping performance compared to non-equivariant models.

03

Reduced computational and memory costs.

Abstract

We propose a new volumetric grasp model that is equivariant to rotations around the vertical axis, leading to a significant improvement in sampling efficiency. Our model employs a tri-plane volumetric feature representation -- i.e., the projection of 3D features onto three canonical planes. We introduce a novel tri-plane feature design in which features on the horizontal plane are \emph{equivariant} to $9 0^{\circ}$ rotations, while the \emph{sum} of features from the other two planes remains \emph{invariant} to reflections induced by the same transformations. We further develop equivariant adaptations of two state-of-the-art volumetric grasp planners, GIGA and IGD. Specifically, we derive a new equivariant formulation of IGD's deformable attention mechanism and propose an equivariant generative model of grasp orientations based on flow matching. We provide a detailed analytical…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

mousecpn/evg-page
github

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.