Advancing Fairness in Natural Language Processing: From Traditional Methods to Explainability
Fanny Jourdan

TL;DR
This thesis advances fairness in NLP by developing new bias mitigation algorithms, analyzing dataset biases, and introducing explainability methods like COCKATIEL and TaCo to enhance transparency and equity.
Contribution
It introduces novel algorithms for bias mitigation, explores dataset bias impacts, and proposes explainability techniques that connect fairness and interpretability in NLP models.
Findings
New bias mitigation algorithm outperforms traditional methods.
Dataset size influences bias and fairness metrics.
COCKATIEL effectively explains Transformer model decisions.
Abstract
The burgeoning field of Natural Language Processing (NLP) stands at a critical juncture where the integration of fairness within its frameworks has become an imperative. This PhD thesis addresses the need for equity and transparency in NLP systems, recognizing that fairness in NLP is not merely a technical challenge but a moral and ethical necessity, requiring a rigorous examination of how these technologies interact with and impact diverse human populations. Through this lens, this thesis undertakes a thorough investigation into the development of equitable NLP methodologies and the evaluation of biases that prevail in current systems. First, it introduces an innovative algorithm to mitigate biases in multi-class classifiers, tailored for high-risk NLP applications, surpassing traditional methods in both bias mitigation and prediction accuracy. Then, an analysis of the Bios dataset…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsEthics and Social Impacts of AI
MethodsAttention Is All You Need · Dropout · Layer Normalization · Adam · Dense Connections · Residual Connection · Position-Wise Feed-Forward Layer · Linear Layer · Byte Pair Encoding · Absolute Position Encodings
