Relation Networks for Object Detection

Han Hu; Jiayuan Gu; Zheng Zhang; Jifeng Dai; Yichen Wei

arXiv:1711.11575·cs.CV·June 15, 2018·67 cites

Relation Networks for Object Detection

Han Hu, Jiayuan Gu, Zheng Zhang, Jifeng Dai, Yichen Wei

PDF

Open Access 5 Repos

TL;DR

This paper introduces a lightweight object relation module that models interactions between objects in deep learning-based detection, improving recognition and enabling the first fully end-to-end object detector.

Contribution

It proposes a novel, lightweight, and easy-to-embed relation module that models object interactions, enhancing CNN-based detection without extra supervision.

Findings

01

Improves object recognition accuracy

02

Enhances duplicate removal in detection pipelines

03

Enables fully end-to-end object detection

Abstract

Although it is well believed for years that modeling relations between objects would help object recognition, there has not been evidence that the idea is working in the deep learning era. All state-of-the-art object detection systems still rely on recognizing object instances individually, without exploiting their relations during learning. This work proposes an object relation module. It processes a set of objects simultaneously through interaction between their appearance feature and geometry, thus allowing modeling of their relations. It is lightweight and in-place. It does not require additional supervision and is easy to embed in existing networks. It is shown effective on improving object recognition and duplicate removal steps in the modern object detection pipeline. It verifies the efficacy of modeling object relations in CNN based detection. It gives rise to the first fully…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · Advanced Image and Video Retrieval Techniques · Multimodal Machine Learning Applications