ATG-MoE: Autoregressive trajectory generation with mixture-of-experts for assembly skill learning
Weihang Huang, Chaoran Zhang, Xiaoxin Deng, Hao Zhou, Zhaobo Xu, Shubo Cui, Long Zeng

TL;DR
ATG-MoE is an end-to-end autoregressive model with mixture-of-experts architecture that learns multiple assembly skills from demonstrations by integrating multi-modal inputs for robust and flexible robotic manipulation.
Contribution
It introduces a unified model that combines multi-modal feature fusion, autoregressive trajectory generation, and mixture-of-experts for efficient multi-skill learning in robotic assembly tasks.
Findings
Achieves 96.3% grasp success rate in simulation
Attains 91.8% overall success rate in assembly tasks
Demonstrates strong generalization and multi-skill integration
Abstract
Flexible manufacturing requires robot systems that can adapt to constantly changing tasks, objects, and environments. However, traditional robot programming is labor-intensive and inflexible, while existing learning-based assembly methods often suffer from weak positional generalization, complex multi-stage designs, and limited multi-skill integration capability. To address these issues, this paper proposes ATG-MoE, an end-to-end autoregressive trajectory generation method with mixture of experts for assembly skill learning from demonstration. The proposed method establishes a closed-loop mapping from multi-modal inputs, including RGB-D observations, natural language instructions, and robot proprioception to manipulation trajectories. It integrates multi-modal feature fusion for scene and task understanding, autoregressive sequence modeling for temporally coherent trajectory generation,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRobot Manipulation and Learning · Robotic Path Planning Algorithms · Social Robot Interaction and HRI
