Hybrid Knowledge Routed Modules for Large-scale Object Detection

Chenhan Jiang; Hang Xu; Xiangdan Liang; Liang Lin

arXiv:1810.12681·cs.CV·October 31, 2018·58 cites

Hybrid Knowledge Routed Modules for Large-scale Object Detection

Chenhan Jiang, Hang Xu, Xiangdan Liang, Liang Lin

PDF

Open Access 1 Repo

TL;DR

This paper introduces Hybrid Knowledge Routed Modules (HKRM) that leverage explicit and implicit human knowledge to improve large-scale object detection, especially in long-tail and confusing categories, by enabling semantic reasoning.

Contribution

The paper proposes HKRM, a novel module that integrates structured and implicit knowledge for enhanced reasoning in object detection networks.

Findings

01

Achieved 34.5% mAP improvement on VisualGenome

02

Achieved 30.4% mAP improvement on ADE

03

Demonstrated effectiveness in large-scale, long-tail scenarios

Abstract

The dominant object detection approaches treat the recognition of each region separately and overlook crucial semantic correlations between objects in one scene. This paradigm leads to substantial performance drop when facing heavy long-tail problems, where very few samples are available for rare classes and plenty of confusing categories exists. We exploit diverse human commonsense knowledge for reasoning over large-scale object categories and reaching semantic coherency within one image. Particularly, we present Hybrid Knowledge Routed Modules (HKRM) that incorporates the reasoning routed by two kinds of knowledge forms: an explicit knowledge module for structured constraints that are summarized with linguistic knowledge (e.g. shared attributes, relationships) about concepts; and an implicit knowledge module that depicts some implicit constraints (e.g. common spatial layouts). By…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

chanyn/HKRM
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · Multimodal Machine Learning Applications · Domain Adaptation and Few-Shot Learning