Learning Transferable Reward for Query Object Localization with Policy   Adaptation

Tingfeng Li; Shaobo Han; Martin Renqiang Min; Dimitris N. Metaxas

arXiv:2202.12403·cs.CV·March 16, 2022

Learning Transferable Reward for Query Object Localization with Policy Adaptation

Tingfeng Li, Shaobo Han, Martin Renqiang Min, Dimitris N. Metaxas

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper introduces a reinforcement learning method for query object localization that uses a transferable reward signal, enabling effective policy adaptation to new environments and classes without extensive retraining.

Contribution

The paper presents a novel transferable reward formulation using ordinal metric learning, allowing test-time policy adaptation and class transfer in object localization tasks.

Findings

01

Outperforms fine-tuning on various datasets

02

Enables class transfer without retraining

03

Effective in corrupted and diverse datasets

Abstract

We propose a reinforcement learning based approach to query object localization, for which an agent is trained to localize objects of interest specified by a small exemplary set. We learn a transferable reward signal formulated using the exemplary set by ordinal metric learning. Our proposed method enables test-time policy adaptation to new environments where the reward signals are not readily available, and outperforms fine-tuning approaches that are limited to annotated images. In addition, the transferable reward allows repurposing the trained agent from one specific class to another class. Experiments on corrupted MNIST, CU-Birds, and COCO datasets demonstrate the effectiveness of our approach.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

litingfeng/localization-by-ordembed
pytorchOfficial

Videos

Learning Transferable Reward for Query Object Localization with Policy Adaptation· slideslive

Taxonomy

TopicsDomain Adaptation and Few-Shot Learning · Multimodal Machine Learning Applications · Advanced Neural Network Applications