Tasks Integrated Networks: Joint Detection and Retrieval for Image Search
Lei Zhang, Zhenwei He, Yi Yang, Liang Wang, Xinbo Gao

TL;DR
This paper introduces an end-to-end integrated network for image search that combines detection and retrieval tasks, addressing real-world scenarios where objects are not precisely annotated, and demonstrates superior performance over existing models.
Contribution
The paper proposes a novel DC-I-Net architecture with task-specific modules and a class-center guided loss, advancing image-level retrieval without bounding-box annotations.
Findings
Outperforms state-of-the-art image search models on benchmark datasets.
Introduces a dynamic feature dictionary with on-line pairing loss.
Employs a class-center guided HEP loss for improved robustness.
Abstract
The traditional object retrieval task aims to learn a discriminative feature representation with intra-similarity and inter-dissimilarity, which supposes that the objects in an image are manually or automatically pre-cropped exactly. However, in many real-world searching scenarios (e.g., video surveillance), the objects (e.g., persons, vehicles, etc.) are seldom accurately detected or annotated. Therefore, object-level retrieval becomes intractable without bounding-box annotation, which leads to a new but challenging topic, i.e. image-level search. In this paper, to address the image search issue, we first introduce an end-to-end Integrated Net (I-Net), which has three merits: 1) A Siamese architecture and an on-line pairing strategy for similar and dissimilar objects in the given images are designed. 2) A novel on-line pairing (OLP) loss is introduced with a dynamic feature dictionary,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Image and Video Retrieval Techniques · Video Surveillance and Tracking Methods · Advanced Neural Network Applications
MethodsSoftmax
