RAFT -- A Domain Adaptation Framework for RGB & LiDAR Semantic Segmentation
Edward Humes, Xiaomin Lin, Boxun Hu, Rithvik Jonna, Tinoosh Mohsenin

TL;DR
RAFT is a new domain adaptation framework that improves RGB and LiDAR semantic segmentation performance in real-world scenarios by using minimal labeled data, data augmentation, and active learning, outperforming previous methods.
Contribution
RAFT introduces a novel domain adaptation approach combining data and feature augmentations with active learning for semantic segmentation.
Findings
Outperforms previous state-of-the-art on synthetic-to-real benchmarks
Achieves significant mIoU improvements in multiple domain adaptation scenarios
Demonstrates effectiveness of minimal labeled data and augmentation strategies
Abstract
Image segmentation is a powerful computer vision technique for scene understanding. However, real-world deployment is stymied by the need for high-quality, meticulously labeled datasets. Synthetic data provides high-quality labels while reducing the need for manual data collection and annotation. However, deep neural networks trained on synthetic data often face the Syn2Real problem, leading to poor performance in real-world deployments. To mitigate the aforementioned gap in image segmentation, we propose RAFT, a novel framework for adapting image segmentation models using minimal labeled real-world data through data and feature augmentations, as well as active learning. To validate RAFT, we perform experiments on the synthetic-to-real "SYNTHIA->Cityscapes" and "GTAV->Cityscapes" benchmarks. We managed to surpass the previous state of the art, HALO. SYNTHIA->Cityscapes experiences an…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Advanced Neural Network Applications · Multimodal Machine Learning Applications
