Bidirectional Decoding: Improving Action Chunking via Guided Test-Time   Sampling

Yuejiang Liu; Jubayer Ibn Hamid; Annie Xie; Yoonho Lee; Maximilian Du,; Chelsea Finn

arXiv:2408.17355·cs.RO·April 28, 2025

Bidirectional Decoding: Improving Action Chunking via Guided Test-Time Sampling

Yuejiang Liu, Jubayer Ibn Hamid, Annie Xie, Yoonho Lee, Maximilian Du,, Chelsea Finn

PDF

Open Access 2 Repos

TL;DR

This paper introduces Bidirectional Decoding, a test-time inference method that enhances action chunking in robot learning by balancing temporal consistency and reactivity, improving policy performance in various tasks.

Contribution

The paper proposes Bidirectional Decoding, a novel test-time sampling algorithm that improves action chunking by integrating backward and forward criteria for better policy adaptation.

Findings

01

BID improves performance across multiple benchmarks.

02

Action chunking captures temporal dependencies but reduces reactivity.

03

BID balances long-term consistency with short-term reactivity.

Abstract

Predicting and executing a sequence of actions without intermediate replanning, known as action chunking, is increasingly used in robot learning from human demonstrations. Yet, its effects on the learned policy remain inconsistent: some studies find it crucial for achieving strong results, while others observe decreased performance. In this paper, we first dissect how action chunking impacts the divergence between a learner and a demonstrator. We find that action chunking allows the learner to better capture the temporal dependencies in demonstrations but at the cost of reduced reactivity to unexpected states. To address this tradeoff, we propose Bidirectional Decoding (BID), a test-time inference algorithm that bridges action chunking with closed-loop adaptation. At each timestep, BID samples multiple candidate predictions and searches for the optimal one based on two criteria: (i)…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Memory and Neural Computing · Neural Networks and Reservoir Computing · Neural dynamics and brain function

MethodsALIGN