Trustworthy Representation Learning via Information Funnels and Bottlenecks

Jo\~ao Machado de Freitas; Bernhard C. Geiger

arXiv:2211.01446·cs.LG·November 6, 2025·1 cites

Trustworthy Representation Learning via Information Funnels and Bottlenecks

Jo\~ao Machado de Freitas, Bernhard C. Geiger

PDF

Open Access

TL;DR

This paper introduces a novel information-theoretic framework, CPFSI, for learning invariant, fair, and private representations in machine learning, demonstrating its effectiveness and real-world applicability especially in data-scarce tabular settings.

Contribution

The paper proposes the Conditional Privacy Funnel with Side-information (CPFSI), a new approach within information bottleneck methods, with neural approximations and analysis of trade-offs for fair, private, and invariant representations.

Findings

01

CPFSI effectively balances utility, fairness, and privacy.

02

Intervening on sensitive attributes improves fairness without sacrificing performance.

03

Method outperforms existing approaches in real-world tabular datasets.

Abstract

Ensuring trustworthiness in machine learning -- by balancing utility, fairness, and privacy -- remains a critical challenge, particularly in representation learning. In this work, we investigate a family of closely related information-theoretic objectives, including information funnels and bottlenecks, designed to extract invariant representations from data. We introduce the Conditional Privacy Funnel with Side-information (CPFSI), a novel formulation within this family, applicable in both fully and semi-supervised settings. Given the intractability of these objectives, we derive neural-network-based approximations via amortized variational inference. We systematically analyze the trade-offs between utility, invariance, and representation fidelity, offering new insights into the Pareto frontiers of these methods. Our results demonstrate that CPFSI effectively balances these competing…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning and Data Classification · Adversarial Robustness in Machine Learning · Privacy-Preserving Technologies in Data

MethodsVariational Inference