Correct-By-Construction: Certified Individual Fairness through Neural Network Training
Ruihan Zhang, Jun Sun

TL;DR
This paper introduces a novel training framework that guarantees individual fairness in neural networks by combining fair initialization and fairness-preserving training, using randomized response mechanisms for efficiency and formal guarantees.
Contribution
It presents a new method that ensures individual fairness throughout training with formal guarantees, combining fair start and fairness-preserving updates using randomized mechanisms.
Findings
Produces empirically fair and accurate models
More efficient than verification-based certified training
Guarantees individual fairness during entire training process
Abstract
Fairness in machine learning is more important than ever as ethical concerns continue to grow. Individual fairness demands that individuals differing only in sensitive attributes receive the same outcomes. However, commonly used machine learning algorithms often fail to achieve such fairness. To improve individual fairness, various training methods have been developed, such as incorporating fairness constraints as optimisation objectives. While these methods have demonstrated empirical effectiveness, they lack formal guarantees of fairness. Existing approaches that aim to provide fairness guarantees primarily rely on verification techniques, which can sometimes fail to produce definitive results. Moreover, verification alone does not actively enhance individual fairness during training. To address this limitation, we propose a novel framework that formally guarantees individual fairness…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
