Generative Hints
Andy Dimnaku, Abdullah Yusuf Kavranoglu, Yaser Abu-Mostafa

TL;DR
Generative hints is a semi-supervised training method that uses a generative model to produce virtual examples, explicitly enforcing known invariances to improve model accuracy over traditional data augmentation.
Contribution
It introduces a novel semi-supervised approach that directly constrains invariance properties using generative models and virtual examples, outperforming standard augmentation methods.
Findings
Achieved up to 2.10% accuracy improvement on fine-grained classification.
Gained an average of 1.29% accuracy on the CheXpert dataset.
Consistently outperformed standard data augmentation across multiple datasets.
Abstract
Data augmentation is widely used in vision to introduce variation and mitigate overfitting, by enabling models to learn invariant properties. However, augmentation only indirectly captures these properties and does not explicitly constrain the learned function to satisfy them beyond the empirical training set. We propose generative hints, a training methodology that directly enforces known functional invariances over the input distribution. Our approach leverages a generative model trained on the training data to approximate the input distribution and to produce unlabeled synthetic images, which we refer to as virtual examples. On these virtual examples, we impose hint objectives that explicitly constrain the model's predictions to satisfy known invariance properties, such as spatial invariance. Although the original training dataset is fully labeled, generative hints train the model in…
Peer Reviews
Decision·Submitted to ICLR 2026
The idea of creating "hints" or functions that capture diverse properties of the target distribution is strong and would help the community. However, the paper only presents invariance-based hints/transformations, which are already standard in the community.
The two main pillars of the presented paper are commonly existing ideas in the ML community, namely: generative (data) augmentation and data augmentation (for invariances). In my understanding, while the paper starts by planning to go beyond these ideas to present "other properties" (aka non-invariant properties like the authors specify for tabular data -- monotonicity), the proposed methodology and experiments do not go beyond existing knowledge to demonstrate any additional ideas or further an
1. Conceptually clear formulation of “hints.” The paper formalizes the notion of a hint as a constraint linking inputs that should yield similar outputs, providing a unifying view that bridges data augmentation, regularization, and semi-supervised learning. 2. This paper Novel use of generative models for invariance enforcement. Rather than simply augmenting labeled samples, the approach explicitly uses a pretrained generator (StyleGAN3) to sample the data manifold and apply transformations in
1. Limited performance gains. Reported improvements are relatively small (average ≈ +0.6 percentage points, maximum ≈ +1.8 points top-1 accuracy). While consistent, the benefits may not justify the additional computational overhead of training or maintaining a generative model. 2. Dependency on generator quality. The approach relies heavily on the fidelity of the generator. The paper notes that when FID > 11, generative hints become ineffective, restricting applicability to domains with strong
1. The paper proposes a new data augmentation method that leverages generative modeling. 2. The approach consistently improves classification performance across several benchmark datasets.
I find two major weaknesses in the current version of the paper. The first is that **the problem formulation contains significant ambiguity**, resulting in unclear or even contradictory statements throughout the paper. For example, in Section 3, Definition 3 fails to clearly define the generative model G. While I understand that the authors may refer to common generative models such as GANs or diffusion models, such assumptions should be explicitly stated for clarity and self-containment. Simila
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenerative Adversarial Networks and Image Synthesis · Domain Adaptation and Few-Shot Learning · Multimodal Machine Learning Applications
