Semi-Supervised Object Detection: A Survey on Progress from CNN to Transformer
Tahira Shehzadi, Ifza, Didier Stricker, Muhammad Zeshan Afzal

TL;DR
This survey reviews 27 recent advancements in semi-supervised object detection, highlighting progress from CNNs to Transformers, and discusses techniques, challenges, and future research directions in reducing reliance on labeled data.
Contribution
The paper provides a comprehensive overview of recent SSOD methods, comparing architectures and techniques from CNNs to Transformers, and identifies key challenges and future research directions.
Findings
Significant performance improvements in SSOD models.
Effective strategies for pseudo-labeling and data augmentation.
Transition from CNN-based to Transformer-based architectures.
Abstract
The impressive advancements in semi-supervised learning have driven researchers to explore its potential in object detection tasks within the field of computer vision. Semi-Supervised Object Detection (SSOD) leverages a combination of a small labeled dataset and a larger, unlabeled dataset. This approach effectively reduces the dependence on large labeled datasets, which are often expensive and time-consuming to obtain. Initially, SSOD models encountered challenges in effectively leveraging unlabeled data and managing noise in generated pseudo-labels for unlabeled data. However, numerous recent advancements have addressed these issues, resulting in substantial improvements in SSOD performance. This paper presents a comprehensive review of 27 cutting-edge developments in SSOD methodologies, from Convolutional Neural Networks (CNNs) to Transformers. We delve into the core components of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Brain Tumor Detection and Classification
