MVB-Grasp: Minimum-Volume-Box Filtering of Diffusion-based Grasps for Frontal Manipulation
Bibek Poudel, Abdul Basit, Muhammad Shafique

TL;DR
This paper introduces MVB-Grasp, a geometric prior-based filtering method that significantly improves frontal grasp success rates for low-cost manipulators by integrating MVBB constraints into diffusion-based grasp generation.
Contribution
It proposes a novel MVBB-based geometric filter and a combined scoring function tailored for frontal workspace constraints, validated through systematic MuJoCo evaluation and real-world experiments.
Findings
MVB-Grasp achieves 59.3% success rate versus 24.7% for vanilla GraspGen.
Filtering infeasible candidates improves grasp reliability.
Real-world trials confirm substantial performance gains.
Abstract
State-of-the-art 6-DoF grasp generators excel on tabletop benchmarks with overhead cameras but struggle in frontal grasping scenarios on low-cost manipulators with constrained workspaces, where kinematic limits and approach-direction constraints cause high failure rates. We address this challenge for the Unitree Z1 arm by proposing MVB-Grasp, a novel grasping stack that injects a Minimum Volume Bounding Box (MVBB) geometric prior into diffusion-based grasp generation to dramatically improve success rates in frontal, workspace-constrained settings. Our key scientific contributions are threefold: (i) an MVBB-based geometric filter that exploits oriented bounding-box face normals to reject grasps approaching through the table or misaligned with accessible object faces in O(N) time; (ii) a combined re-scoring function that blends learned discriminator scores with face-alignment geometry…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
