Heterogeneous Multi-Expert Reinforcement Learning for Long-Horizon Multi-Goal Tasks in Autonomous Forklifts
Yun Chen, Bowei Huang, Fan Guo, Kang Song

TL;DR
This paper introduces Heterogeneous Multi-Expert Reinforcement Learning (HMER), a framework that decomposes long-horizon tasks into specialized sub-policies for autonomous forklifts, improving efficiency and precision in warehouse operations.
Contribution
The paper presents a novel HMER framework that separates navigation and manipulation tasks with specialized experts and a semantic planner, enhancing performance in complex warehouse tasks.
Findings
Achieved 94.2% task success rate in simulation
Reduced operation time by 21.4% compared to baselines
Maintained placement error within 1.5 cm
Abstract
Autonomous mobile manipulation in unstructured warehouses requires a balance between efficient large-scale navigation and high-precision object interaction. Traditional end-to-end learning approaches often struggle to handle the conflicting demands of these distinct phases. Navigation relies on robust decision-making over large spaces, while manipulation needs high sensitivity to fine local details. Forcing a single network to learn these different objectives simultaneously often causes optimization interference, where improving one task degrades the other. To address these limitations, we propose a Heterogeneous Multi-Expert Reinforcement Learning (HMER) framework tailored for autonomous forklifts. HMER decomposes long-horizon tasks into specialized sub-policies controlled by a Semantic Task Planner. This structure separates macro-level navigation from micro-level manipulation,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRobotic Path Planning Algorithms · Robot Manipulation and Learning · Reinforcement Learning in Robotics
