Real-World Reinforcement Learning of Active Perception Behaviors

Edward S. Hu; Jie Wang; Xingfang Yuan; Fiona Luo; Muyao Li; Gaspard Lambrechts; Oleh Rybkin; Dinesh Jayaraman

arXiv:2512.01188·cs.RO·December 2, 2025

Real-World Reinforcement Learning of Active Perception Behaviors

Edward S. Hu, Jie Wang, Xingfang Yuan, Fiona Luo, Muyao Li, Gaspard Lambrechts, Oleh Rybkin, Dinesh Jayaraman

PDF

Open Access 1 Datasets 1 Video

TL;DR

This paper introduces AAWR, a simple and efficient method for training active perception policies in robots, leveraging privileged sensors during training to improve performance under partial observability.

Contribution

The paper presents AAWR, a novel training approach that uses privileged sensors and demonstrations to quickly learn active perception behaviors in robots.

Findings

01

AAWR outperforms prior methods on 8 manipulation tasks across 3 robots.

02

It enables robots to operate effectively under severe partial observability.

03

AAWR efficiently generates information-gathering behaviors from suboptimal initial policies.

Abstract

A robot's instantaneous sensory observations do not always reveal task-relevant state information. Under such partial observability, optimal behavior typically involves explicitly acting to gain the missing information. Today's standard robot learning techniques struggle to produce such active perception behaviors. We propose a simple real-world robot learning recipe to efficiently train active perception policies. Our approach, asymmetric advantage weighted regression (AAWR), exploits access to "privileged" extra sensors at training time. The privileged sensors enable training high-quality privileged value functions that aid in estimating the advantage of the target policy. Bootstrapping from a small number of potentially suboptimal demonstrations and an easy-to-obtain coarse policy initialization, AAWR quickly acquires active perception behaviors and boosts task performance. In…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Datasets

Everloom/AAWR-DROID
dataset· 7.5k dl
7.5k dl

Videos

Real-World Reinforcement Learning of Active Perception Behaviors· slideslive

Taxonomy

TopicsRobot Manipulation and Learning · Reinforcement Learning in Robotics · Gaussian Processes and Bayesian Inference