Fair Active Learning: Solving the Labeling Problem in Insurance

Romuald Elie; Caroline Hillairet; Fran\c{c}ois Hu; Marc Juillard

arXiv:2112.09466·stat.ML·May 21, 2024

Fair Active Learning: Solving the Labeling Problem in Insurance

Romuald Elie, Caroline Hillairet, Fran\c{c}ois Hu, Marc Juillard

PDF

Open Access

TL;DR

This paper introduces a novel fair active learning method for insurance models that balances predictive accuracy and fairness, addressing bias and reducing labeling efforts in sensitive data contexts.

Contribution

It proposes an innovative sampling approach that combines informativeness and fairness, improving model fairness without sacrificing performance.

Findings

01

Effective balancing of fairness and accuracy demonstrated on insurance datasets

02

Sampling method reduces bias and enhances fairness in model inferences

03

Numerical experiments confirm improved fairness metrics

Abstract

This paper addresses significant obstacles that arise from the widespread use of machine learning models in the insurance industry, with a specific focus on promoting fairness. The initial challenge lies in effectively leveraging unlabeled data in insurance while reducing the labeling effort and emphasizing data relevance through active learning techniques. The paper explores various active learning sampling methodologies and evaluates their impact on both synthetic and real insurance datasets. This analysis highlights the difficulty of achieving fair model inferences, as machine learning models may replicate biases and discrimination found in the underlying data. To tackle these interconnected challenges, the paper introduces an innovative fair active learning method. The proposed approach samples informative and fair instances, achieving a good balance between model predictive…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning and Algorithms · Advanced Causal Inference Techniques · Machine Learning in Healthcare