NBIAS: A Natural Language Processing Framework for Bias Identification in Text
Shaina Raza, Muskan Garg, Deepak John Reji, Syed Raza Bashir, Chen, Ding

TL;DR
This paper introduces NBIAS, a comprehensive NLP framework designed to detect biases in text across various domains, aiming to promote fair and ethical data usage through a transformer-based bias identification model.
Contribution
The paper presents a novel multi-layer framework with a transformer-based model for bias detection, including diverse datasets and a combined evaluation approach.
Findings
Accuracy improvements of 1% to 8% over baselines
Effective identification of bias words and phrases
Robust understanding of model functioning
Abstract
Bias in textual data can lead to skewed interpretations and outcomes when the data is used. These biases could perpetuate stereotypes, discrimination, or other forms of unfair treatment. An algorithm trained on biased data may end up making decisions that disproportionately impact a certain group of people. Therefore, it is crucial to detect and remove these biases to ensure the fair and ethical use of data. To this end, we develop a comprehensive and robust framework NBIAS that consists of four main layers: data, corpus construction, model development and an evaluation layer. The dataset is constructed by collecting diverse data from various domains, including social media, healthcare, and job hiring portals. As such, we applied a transformer-based token classification model that is able to identify bias words/ phrases through a unique named entity BIAS. In the evaluation procedure, we…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAuthorship Attribution and Profiling · Hate Speech and Cyberbullying Detection · Names, Identity, and Discrimination Research
