Video Class Agnostic Segmentation with Contrastive Learning for Autonomous Driving
Mennatullah Siam, Alex Kendall, Martin Jagersand

TL;DR
This paper introduces a contrastive learning approach for video class agnostic segmentation in autonomous driving, improving segmentation of both known and unknown objects, especially in small datasets.
Contribution
It proposes a novel pixel-wise contrastive loss leveraging temporal guidance and releases a synthetic dataset for unknown objects in autonomous driving.
Findings
Contrastive loss improves segmentation of unknown objects.
Method is more effective on small datasets.
Synthetic dataset includes rare unknown objects.
Abstract
Semantic segmentation in autonomous driving predominantly focuses on learning from large-scale data with a closed set of known classes without considering unknown objects. Motivated by safety reasons, we address the video class agnostic segmentation task, which considers unknown objects outside the closed set of known classes in our training data. We propose a novel auxiliary contrastive loss to learn the segmentation of known classes and unknown objects. Unlike previous work in contrastive learning that samples the anchor, positive and negative examples on an image level, our contrastive learning method leverages pixel-wise semantic and temporal guidance. We conduct experiments on Cityscapes-VPS by withholding four classes from training and show an improvement gain for both known and unknown objects segmentation with the auxiliary contrastive loss. We further release a large-scale…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Domain Adaptation and Few-Shot Learning · Multimodal Machine Learning Applications
MethodsContrastive Learning
