Aligned Unsupervised Pretraining of Object Detectors with Self-training

Ioannis Maniadis Metaxas; Adrian Bulat; Ioannis Patras; Brais; Martinez; Georgios Tzimiropoulos

arXiv:2307.15697·cs.CV·July 9, 2024·1 cites

Aligned Unsupervised Pretraining of Object Detectors with Self-training

Ioannis Maniadis Metaxas, Adrian Bulat, Ioannis Patras, Brais, Martinez, Georgios Tzimiropoulos

PDF

Open Access

TL;DR

This paper introduces an aligned unsupervised pretraining framework for object detectors that uses richer proposals, class pseudo-labeling, and self-training, achieving state-of-the-art results and enabling pretraining from scratch on complex datasets.

Contribution

It proposes a novel unsupervised pretraining method that aligns with downstream detection tasks using high-level proposals, clustering, and self-training, simplifying the pipeline and improving performance.

Findings

01

Achieves state-of-the-art detection performance in low and full data regimes.

02

Enables pretraining from scratch on complex datasets like COCO.

03

Works across various detector architectures and datasets.

Abstract

The unsupervised pretraining of object detectors has recently become a key component of object detector training, as it leads to improved performance and faster convergence during the supervised fine-tuning stage. Existing unsupervised pretraining methods, however, typically rely on low-level information to define proposals that are used to train the detector. Furthermore, in the absence of class labels for these proposals, an auxiliary loss is used to add high-level semantics. This results in complex pipelines and a task gap between the pretraining and the downstream task. We propose a framework that mitigates this issue and consists of three simple yet key ingredients: (i) richer initial proposals that do encode high-level semantics, (ii) class pseudo-labeling through clustering, that enables pretraining using a standard object detection training pipeline, (iii) self-training to…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDomain Adaptation and Few-Shot Learning · Advanced Neural Network Applications · COVID-19 diagnosis using AI

MethodsAttention Is All You Need · Softmax · Position-Wise Feed-Forward Layer · Layer Normalization · Linear Layer · Dense Connections · Label Smoothing · Dropout · Adam · Absolute Position Encodings