Parametric Information Maximization for Generalized Category Discovery

Florent Chiaroni; Jose Dolz; Ziko Imtiaz Masud; Amar Mitiche; Ismail; Ben Ayed

arXiv:2212.00334·cs.CV·July 17, 2023·1 cites

Parametric Information Maximization for Generalized Category Discovery

Florent Chiaroni, Jose Dolz, Ziko Imtiaz Masud, Amar Mitiche, Ismail, Ben Ayed

PDF

Open Access 1 Repo

TL;DR

This paper presents a Parametric Information Maximization model for Generalized Category Discovery, effectively handling class imbalance and achieving state-of-the-art results across diverse datasets, including fine-grained categories.

Contribution

The paper introduces a bi-level optimization framework that explores a family of objective functions to improve GCD performance, especially on imbalanced datasets.

Findings

01

Consistently outperforms existing methods on six datasets.

02

Effectively handles both short-tailed and long-tailed data.

03

Achieves new state-of-the-art results in fine-grained GCD tasks.

Abstract

We introduce a Parametric Information Maximization (PIM) model for the Generalized Category Discovery (GCD) problem. Specifically, we propose a bi-level optimization formulation, which explores a parameterized family of objective functions, each evaluating a weighted mutual information between the features and the latent labels, subject to supervision constraints from the labeled samples. Our formulation mitigates the class-balance bias encoded in standard information maximization approaches, thereby handling effectively both short-tailed and long-tailed data sets. We report extensive experiments and comparisons demonstrating that our PIM model consistently sets new state-of-the-art performances in GCD across six different datasets, more so when dealing with challenging fine-grained problems.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

fchiaroni/mutual-information-based-gcd
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsText and Document Classification Technologies · Machine Learning and Data Classification