Localize to Classify and Classify to Localize: Mutual Guidance in Object   Detection

Heng Zhang; Elisa Fromont; S\'ebastien Lefevre; Bruno Avignon

arXiv:2009.14085·cs.CV·September 30, 2020·1 cites

Localize to Classify and Classify to Localize: Mutual Guidance in Object Detection

Heng Zhang, Elisa Fromont, S\'ebastien Lefevre, Bruno Avignon

PDF

Open Access 1 Repo

TL;DR

This paper introduces a mutual guidance strategy for object detection that dynamically improves anchor matching by jointly optimizing localization and classification tasks, leading to better performance on standard datasets.

Contribution

It proposes a novel anchor matching criterion that uses mutual guidance between localization and classification during training, enhancing detection accuracy.

Findings

01

Improved detection performance on PASCAL VOC and MS COCO datasets.

02

Demonstrated effectiveness across various deep learning architectures.

03

Showed generality and simplicity of the proposed method.

Abstract

Most deep learning object detectors are based on the anchor mechanism and resort to the Intersection over Union (IoU) between predefined anchor boxes and ground truth boxes to evaluate the matching quality between anchors and objects. In this paper, we question this use of IoU and propose a new anchor matching criterion guided, during the training phase, by the optimization of both the localization and the classification tasks: the predictions related to one task are used to dynamically assign sample anchors and improve the model on the other task, and vice versa. Despite the simplicity of the proposed method, our experiments with different state-of-the-art deep learning architectures on PASCAL VOC and MS COCO datasets demonstrate the effectiveness and generality of our Mutual Guidance strategy.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

ZHANGHeng19931123/MutualGuide
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · Advanced Image and Video Retrieval Techniques · Domain Adaptation and Few-Shot Learning

MethodsMutual Guidance