FairViT: Fair Vision Transformer via Adaptive Masking

Bowei Tian; Ruijie Du; Yanning Shen

arXiv:2407.14799·cs.CV·July 23, 2024

FairViT: Fair Vision Transformer via Adaptive Masking

Bowei Tian, Ruijie Du, Yanning Shen

PDF

Open Access 1 Repo

TL;DR

FairViT is a novel framework for Vision Transformers that enhances fairness without sacrificing accuracy by using adaptive masking and a new distance loss, achieving superior performance and fairness in vision tasks.

Contribution

The paper introduces FairViT, a new fair and accurate Vision Transformer framework utilizing adaptive masks and a distance loss to improve fairness and accuracy simultaneously.

Findings

01

FairViT outperforms existing methods in accuracy.

02

FairViT achieves significant fairness improvements.

03

The approach maintains computational efficiency.

Abstract

Vision Transformer (ViT) has achieved excellent performance and demonstrated its promising potential in various computer vision tasks. The wide deployment of ViT in real-world tasks requires a thorough understanding of the societal impact of the model. However, most ViT-based works do not take fairness into account and it is unclear whether directly applying CNN-oriented debiased algorithm to ViT is feasible. Moreover, previous works typically sacrifice accuracy for fairness. Therefore, we aim to develop an algorithm that improves accuracy without sacrificing fairness. In this paper, we propose FairViT, a novel accurate and fair ViT framework. To this end, we introduce a novel distance loss and deploy adaptive fairness-aware masks on attention layers updating with model parameters. Experimental results show \sys can achieve accuracy better than other alternatives, even with competitive…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

abdd68/Fair-Vision-Transformer
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsInfrared Target Detection Methodologies · Optical Polarization and Ellipsometry · Advanced Optical Imaging Technologies

MethodsAttention Is All You Need · Byte Pair Encoding · Layer Normalization · Label Smoothing · Linear Layer · Softmax · Position-Wise Feed-Forward Layer · Absolute Position Encodings · Multi-Head Attention · Dense Connections