Learning to View: Decision Transformers for Active Object Detection

Wenhao Ding; Nathalie Majcherczyk; Mohit Deshpande; Xuewei Qi; Ding; Zhao; Rajasimman Madhivanan; Arnie Sen

arXiv:2301.09544·cs.RO·January 24, 2023

Learning to View: Decision Transformers for Active Object Detection

Wenhao Ding, Nathalie Majcherczyk, Mohit Deshpande, Xuewei Qi, Ding, Zhao, Rajasimman Madhivanan, Arnie Sen

PDF

Open Access

TL;DR

This paper introduces a reinforcement learning approach using Decision Transformers for active object detection, enabling robots to plan views that improve detection quality through online fine-tuning.

Contribution

It presents a novel application of Decision Transformers with online fine-tuning for active perception, enhancing detection performance in robotic systems.

Findings

01

Outperforms baseline methods including expert policies and offline RL.

02

Demonstrates improved detection quality through active view planning.

03

Provides detailed analysis of reward and observation spaces.

Abstract

Active perception describes a broad class of techniques that couple planning and perception systems to move the robot in a way to give the robot more information about the environment. In most robotic systems, perception is typically independent of motion planning. For example, traditional object detection is passive: it operates only on the images it receives. However, we have a chance to improve the results if we allow planning to consume detection signals and move the robot to collect views that maximize the quality of the results. In this paper, we use reinforcement learning (RL) methods to control the robot in order to obtain images that maximize the detection quality. Specifically, we propose using a Decision Transformer with online fine-tuning, which first optimizes the policy with a pre-collected expert dataset and then improves the learned policy by exploring better solutions…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Adversarial Robustness in Machine Learning · Robotic Path Planning Algorithms

MethodsMulti-Head Attention · Attention Is All You Need · Dense Connections · Adam · Position-Wise Feed-Forward Layer · Softmax · Linear Layer · Absolute Position Encodings · Dropout · Label Smoothing