GAIA: A Transfer Learning System of Object Detection that Fits Your Needs
Xingyuan Bu, Junran Peng, Junjie Yan, Tieniu Tan, Zhaoxiang Zhang

TL;DR
GAIA is a transfer learning system for object detection that efficiently produces customized models tailored to specific latency, data, and resource constraints, demonstrating strong results across multiple datasets.
Contribution
GAIA introduces an automated system that generates tailored object detection models for diverse downstream needs, addressing the challenge of large-scale pre-training costs.
Findings
Achieves AP from 38.2 to 46.5 on COCO
Produces models with latency from 16ms to 53ms
Effective across multiple datasets including COCO, Objects365, and others
Abstract
Transfer learning with pre-training on large-scale datasets has played an increasingly significant role in computer vision and natural language processing recently. However, as there exist numerous application scenarios that have distinctive demands such as certain latency constraints and specialized data distributions, it is prohibitively expensive to take advantage of large-scale pre-training for per-task requirements. In this paper, we focus on the area of object detection and present a transfer learning system named GAIA, which could automatically and efficiently give birth to customized solutions according to heterogeneous downstream needs. GAIA is capable of providing powerful pre-trained weights, selecting models that conform to downstream demands such as latency constraints and specified data domains, and collecting relevant data for practitioners who have very few datapoints…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Domain Adaptation and Few-Shot Learning · Advanced Image and Video Retrieval Techniques
