Loading paper
Clutter-Robust Vision-Language-Action Models through Object-Centric and Geometry Grounding | Tomesphere