Rein++: Efficient Generalization and Adaptation for Semantic Segmentation with Vision Foundation Models

Zhixiang Wei; Xiaoxiao Ma; Ruishen Yan; Tao Tu; Huaian Chen; Jinjin Zheng; Yi Jin; Enhong Chen

arXiv:2508.01667·cs.CV·August 5, 2025

Rein++: Efficient Generalization and Adaptation for Semantic Segmentation with Vision Foundation Models

Zhixiang Wei, Xiaoxiao Ma, Ruishen Yan, Tao Tu, Huaian Chen, Jinjin Zheng, Yi Jin, Enhong Chen

PDF

Open Access

TL;DR

Rein++ is a novel framework that enhances semantic segmentation by improving generalization from limited data and enabling effective adaptation to diverse, unlabeled real-world scenarios using vision foundation models.

Contribution

It introduces a parameter-efficient method with trainable tokens for better generalization and a dual-level unsupervised domain adaptation approach, including a semantic transfer module, to handle domain shifts.

Findings

01

Rein++ outperforms state-of-the-art methods in segmentation accuracy.

02

It requires less than 1% of backbone parameters for training.

03

The framework effectively adapts to diverse target domains without labels.

Abstract

Vision Foundation Models(VFMs) have achieved remarkable success in various computer vision tasks. However, their application to semantic segmentation is hindered by two significant challenges: (1) the disparity in data scale, as segmentation datasets are typically much smaller than those used for VFM pre-training, and (2) domain distribution shifts, where real-world segmentation scenarios are diverse and often underrepresented during pre-training. To overcome these limitations, we present Rein++, an efficient VFM-based segmentation framework that demonstrates superior generalization from limited data and enables effective adaptation to diverse unlabeled scenarios. Specifically, Rein++ comprises a domain generalization solution Rein-G and a domain adaptation solution Rein-A. Rein-G introduces a set of trainable, instance-aware tokens that effectively refine the VFM's features for the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDomain Adaptation and Few-Shot Learning · Multimodal Machine Learning Applications · Advanced Neural Network Applications