Self Distillation via Iterative Constructive Perturbations

Maheak Dave; Aniket Kumar Singh; Aryan Pareek; Harshita Jha; Debasis Chaudhuri; Manish Pratap Singh

arXiv:2505.14751·cs.LG·May 22, 2025

Self Distillation via Iterative Constructive Perturbations

Maheak Dave, Aniket Kumar Singh, Aryan Pareek, Harshita Jha, Debasis Chaudhuri, Manish Pratap Singh

PDF

Open Access

TL;DR

This paper introduces a novel cyclic optimization framework using Iterative Constructive Perturbation (ICP) to enhance neural network training by jointly optimizing model parameters and input data, improving performance and generalization.

Contribution

It proposes ICP within a self-distillation framework, enabling iterative input perturbations that refine representations and improve neural network training beyond traditional methods.

Findings

01

Significant performance improvements across various training scenarios

02

Effective mitigation of overfitting and generalization gaps

03

Enhanced intermediate feature representations

Abstract

Deep Neural Networks have achieved remarkable achievements across various domains, however balancing performance and generalization still remains a challenge while training these networks. In this paper, we propose a novel framework that uses a cyclic optimization strategy to concurrently optimize the model and its input data for better training, rethinking the traditional training paradigm. Central to our approach is Iterative Constructive Perturbation (ICP), which leverages the model's loss to iteratively perturb the input, progressively constructing an enhanced representation over some refinement steps. This ICP input is then fed back into the model to produce improved intermediate features, which serve as a target in a self-distillation framework against the original features. By alternately altering the model's parameters to the data and the data to the model, our method…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsProcess Optimization and Integration