Lang2Lift: A Language-Guided Autonomous Forklift System for Outdoor Industrial Pallet Handling
Huy Hoang Nguyen, Johannes Huemer, Markus Murschitz, Tobias Glueck, Minh Nhat Vu, Andreas Kugi

TL;DR
Lang2Lift is an innovative autonomous forklift system that uses natural language instructions and perception modules to identify, locate, and pick up pallets in complex outdoor environments, enhancing flexibility and practicality.
Contribution
The paper introduces a novel language-guided perception and control system for outdoor forklifts, enabling flexible pallet handling in unstructured, real-world scenarios.
Findings
Successful deployment on outdoor forklift platform
Effective language-grounded perception for pallet identification
Accurate pose estimation for manipulation tasks
Abstract
Automating pallet handling in outdoor logistics and construction environments remains challenging due to unstructured scenes, variable pallet configurations, and changing environmental conditions. In this paper, we present Lang2Lift, an end-to-end language-guided autonomous forklift system designed to support practical pallet pick-up operations in real-world outdoor settings. The system enables operators to specify target pallets using natural language instructions, allowing flexible selection among multiple pallets with different loads and spatial arrangements. Lang2Lift integrates foundation-model-based perception modules with motion planning and control in a closed-loop autonomy pipeline. Language-grounded visual perception is used to identify and segment target pallets, followed by 6D pose estimation and geometric refinement to generate manipulation-feasible insertion poses. The…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRobot Manipulation and Learning
