LSDA: Large Scale Detection Through Adaptation
Judy Hoffman, Sergio Guadarrama, Eric Tzeng, Ronghang Hu, Jeff, Donahue, Ross Girshick, Trevor Darrell, and Kate Saenko

TL;DR
This paper introduces LSDA, a method that leverages large-scale classification data to enable detection for many categories lacking bounding box annotations, significantly expanding detection capabilities.
Contribution
The paper presents a novel adaptation approach that transfers knowledge from classification to detection, enabling large-scale detection without extensive bounding box annotations.
Findings
Successfully trained a detector for over 7,600 categories.
Demonstrated the effectiveness of transfer learning from classification to detection.
Achieved a detection speed of 2 frames per second for large-scale detectors.
Abstract
A major challenge in scaling object detection is the difficulty of obtaining labeled images for large numbers of categories. Recently, deep convolutional neural networks (CNNs) have emerged as clear winners on object classification benchmarks, in part due to training with 1.2M+ labeled classification images. Unfortunately, only a small fraction of those labels are available for the detection task. It is much cheaper and easier to collect large quantities of image-level labels from search engines than it is to collect detection data and label it with precise bounding boxes. In this paper, we propose Large Scale Detection through Adaptation (LSDA), an algorithm which learns the difference between the two tasks and transfers this knowledge to classifiers for categories without bounding box annotated data, turning them into detectors. Our method has the potential to enable detection for the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Advanced Image and Video Retrieval Techniques · Video Surveillance and Tracking Methods
