TFS-ViT: Token-Level Feature Stylization for Domain Generalization
Mehrdad Noori, Milad Cheraghalikhani, Ali Bahri, Gustavo A. Vargas, Hakim, David Osowiechi, Ismail Ben Ayed, Christian Desrosiers

TL;DR
TFS-ViT introduces a token-level feature stylization method for Vision Transformers, enhancing their ability to generalize to unseen domains by synthesizing new domain styles through normalization statistic mixing, with attention-aware enhancements.
Contribution
This work is the first to propose token-level feature stylization for ViTs, improving domain generalization by synthesizing diverse domain styles using attention-guided normalization mixing.
Findings
Achieves state-of-the-art results on five domain generalization benchmarks.
Effectively handles various types of domain shifts.
Flexible and compatible with any ViT architecture.
Abstract
Standard deep learning models such as convolutional neural networks (CNNs) lack the ability of generalizing to domains which have not been seen during training. This problem is mainly due to the common but often wrong assumption of such models that the source and target data come from the same i.i.d. distribution. Recently, Vision Transformers (ViTs) have shown outstanding performance for a broad range of computer vision tasks. However, very few studies have investigated their ability to generalize to new domains. This paper presents a first Token-level Feature Stylization (TFS-ViT) approach for domain generalization, which improves the performance of ViTs to unseen data by synthesizing new domains. Our approach transforms token features by mixing the normalization statistics of images from different domains. We further improve this approach with a novel strategy for attention-aware…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Multimodal Machine Learning Applications · Advanced Neural Network Applications
