ATG-MoE: Autoregressive trajectory generation with mixture-of-experts for assembly skill learning

Weihang Huang; Chaoran Zhang; Xiaoxin Deng; Hao Zhou; Zhaobo Xu; Shubo Cui; Long Zeng

arXiv:2603.19029·cs.RO·March 20, 2026

ATG-MoE: Autoregressive trajectory generation with mixture-of-experts for assembly skill learning

Weihang Huang, Chaoran Zhang, Xiaoxin Deng, Hao Zhou, Zhaobo Xu, Shubo Cui, Long Zeng

PDF

Open Access 1 Datasets

TL;DR

ATG-MoE is an end-to-end autoregressive model with mixture-of-experts architecture that learns multiple assembly skills from demonstrations by integrating multi-modal inputs for robust and flexible robotic manipulation.

Contribution

It introduces a unified model that combines multi-modal feature fusion, autoregressive trajectory generation, and mixture-of-experts for efficient multi-skill learning in robotic assembly tasks.

Findings

01

Achieves 96.3% grasp success rate in simulation

02

Attains 91.8% overall success rate in assembly tasks

03

Demonstrates strong generalization and multi-skill integration

Abstract

Flexible manufacturing requires robot systems that can adapt to constantly changing tasks, objects, and environments. However, traditional robot programming is labor-intensive and inflexible, while existing learning-based assembly methods often suffer from weak positional generalization, complex multi-stage designs, and limited multi-skill integration capability. To address these issues, this paper proposes ATG-MoE, an end-to-end autoregressive trajectory generation method with mixture of experts for assembly skill learning from demonstration. The proposed method establishes a closed-loop mapping from multi-modal inputs, including RGB-D observations, natural language instructions, and robot proprioception to manipulation trajectories. It integrates multi-modal feature fusion for scene and task understanding, autoregressive sequence modeling for temporally coherent trajectory generation,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Datasets

failo0711/ATG-MoE_TrainingSet
dataset· 7.0k dl
7.0k dl

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsRobot Manipulation and Learning · Robotic Path Planning Algorithms · Social Robot Interaction and HRI