Hybrid Knowledge Routed Modules for Large-scale Object Detection
Chenhan Jiang, Hang Xu, Xiangdan Liang, Liang Lin

TL;DR
This paper introduces Hybrid Knowledge Routed Modules (HKRM) that leverage explicit and implicit human knowledge to improve large-scale object detection, especially in long-tail and confusing categories, by enabling semantic reasoning.
Contribution
The paper proposes HKRM, a novel module that integrates structured and implicit knowledge for enhanced reasoning in object detection networks.
Findings
Achieved 34.5% mAP improvement on VisualGenome
Achieved 30.4% mAP improvement on ADE
Demonstrated effectiveness in large-scale, long-tail scenarios
Abstract
The dominant object detection approaches treat the recognition of each region separately and overlook crucial semantic correlations between objects in one scene. This paradigm leads to substantial performance drop when facing heavy long-tail problems, where very few samples are available for rare classes and plenty of confusing categories exists. We exploit diverse human commonsense knowledge for reasoning over large-scale object categories and reaching semantic coherency within one image. Particularly, we present Hybrid Knowledge Routed Modules (HKRM) that incorporates the reasoning routed by two kinds of knowledge forms: an explicit knowledge module for structured constraints that are summarized with linguistic knowledge (e.g. shared attributes, relationships) about concepts; and an implicit knowledge module that depicts some implicit constraints (e.g. common spatial layouts). By…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Multimodal Machine Learning Applications · Domain Adaptation and Few-Shot Learning
