Semantic Curiosity for Active Visual Learning

Devendra Singh Chaplot; Helen Jiang; Saurabh Gupta; Abhinav Gupta

arXiv:2006.09367·cs.CV·June 17, 2020

Semantic Curiosity for Active Visual Learning

Devendra Singh Chaplot, Helen Jiang, Saurabh Gupta, Abhinav Gupta

PDF

TL;DR

This paper introduces a semantic curiosity-driven exploration policy for active visual learning, enabling an agent to efficiently select data for labeling to improve object detection in new environments.

Contribution

It proposes a self-supervised semantic curiosity approach for training exploration policies, reducing the need for extensive labeled data and improving generalization across scenes.

Findings

01

Semantic curiosity outperforms random exploration.

02

The approach generalizes well to unseen scenes.

03

It leads to better object detection performance.

Abstract

In this paper, we study the task of embodied interactive learning for object detection. Given a set of environments (and some labeling budget), our goal is to learn an object detector by having an agent select what data to obtain labels for. How should an exploration policy decide which trajectory should be labeled? One possibility is to use a trained object detector's failure cases as an external reward. However, this will require labeling millions of frames required for training RL policies, which is infeasible. Instead, we explore a self-supervised approach for training our exploration policy by introducing a notion of semantic curiosity. Our semantic curiosity policy is based on a simple observation -- the detection outputs should be consistent. Therefore, our semantic curiosity rewards trajectories with inconsistent labeling behavior and encourages the exploration policy to explore…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.