Lan-grasp: Using Large Language Models for Semantic Object Grasping and Placement
Reihaneh Mirjalili, Michael Krawez, Yannik Blei, Simone Silenzi, Florian Walter, Wolfram Burgard

TL;DR
Lan-grasp introduces a zero-shot, semantic-aware robotic grasping method leveraging foundation models, enabling safer, more meaningful object manipulation without additional training, demonstrated through real-world experiments and user surveys.
Contribution
The paper presents a novel zero-shot semantic grasping approach using foundation models, improving safety and effectiveness in robotic object manipulation.
Findings
Higher participant-ranked grasps compared to baseline methods
Effective zero-shot grasping across diverse objects
Enhanced safety with the proposed placement strategy
Abstract
In this paper, we propose Lan-grasp, a novel approach towards more appropriate semantic grasping and placing. We leverage foundation models to equip the robot with a semantic understanding of object geometry, enabling it to identify the right place to grasp, which parts to avoid, and the natural pose for placement. This is an important contribution to grasping and utilizing objects in a more meaningful and safe manner. We leverage a combination of a Large Language Model, a Vision-Language Model, and a traditional grasp planner to generate grasps that demonstrate a deeper semantic understanding of the objects. Building on foundation models provides us with a zero-shot grasp method that can handle a wide range of objects without requiring further training or fine-tuning. We also propose a method for safely putting down a grasped object. The core idea is to rotate the object upright…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMultimodal Machine Learning Applications · Robot Manipulation and Learning · Robotics and Automated Systems
