Visual and Semantic Knowledge Transfer for Large Scale Semi-supervised   Object Detection

Yuxing Tang; Josiah Wang; Xiaofang Wang; Boyang Gao; Emmanuel; Dellandrea; Robert Gaizauskas; Liming Chen

arXiv:1801.03145·cs.CV·March 14, 2018

Visual and Semantic Knowledge Transfer for Large Scale Semi-supervised Object Detection

Yuxing Tang, Josiah Wang, Xiaofang Wang, Boyang Gao, Emmanuel, Dellandrea, Robert Gaizauskas, Liming Chen

PDF

TL;DR

This paper introduces a novel semi-supervised object detection method that leverages visual and semantic similarities between categories to improve knowledge transfer from image classifiers to object detectors, achieving state-of-the-art results.

Contribution

It incorporates object similarities from visual and semantic domains into the transfer process, enhancing detection performance in semi-supervised learning.

Findings

01

Visual and semantic similarities are complementary for knowledge transfer.

02

The proposed methods outperform baseline approaches.

03

Achieves state-of-the-art semi-supervised detection results.

Abstract

Deep CNN-based object detection systems have achieved remarkable success on several large-scale object detection benchmarks. However, training such detectors requires a large number of labeled bounding boxes, which are more difficult to obtain than image-level annotations. Previous work addresses this issue by transforming image-level classifiers into object detectors. This is done by modeling the differences between the two on categories with both image-level and bounding box annotations, and transferring this information to convert classifiers to detectors for categories without bounding box annotations. We improve this previous work by incorporating knowledge about object similarities from visual and semantic domains during the transfer process. The intuition behind our proposed method is that visually and semantically similar categories should exhibit more common transferable…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.