Synthetic Adaptive Guided Embeddings (SAGE): A Novel Knowledge Distillation Method

Suleyman Olcay Polat; Poli A. Nemkova; Mark V. Albert

arXiv:2508.14783·cs.LG·August 21, 2025

Synthetic Adaptive Guided Embeddings (SAGE): A Novel Knowledge Distillation Method

Suleyman Olcay Polat, Poli A. Nemkova, Mark V. Albert

PDF

Open Access

TL;DR

SAGE introduces an adaptive knowledge distillation method that dynamically generates synthetic training data in high-loss regions, improving efficiency and performance of compact models in NLP tasks.

Contribution

The paper presents a novel adaptive distillation framework using UMAP-based augmentation and a lightweight interface for efficient knowledge transfer.

Findings

01

Achieves state-of-the-art results with fewer training epochs.

02

Matches or surpasses baseline performance on NLP benchmarks.

03

Reduces computational overhead in model distillation.

Abstract

Model distillation enables the transfer of knowledge from large-scale models to compact student models, facilitating deployment in resource-constrained environments. However, conventional distillation approaches often suffer from computational overhead and limited generalization. We propose a novel adaptive distillation framework that dynamically augments training data in regions of high student model loss. Using UMAP-based dimensionality reduction and nearest neighbor sampling, our method identifies underperforming regions in the embedding space and generates targeted synthetic examples to guide student learning. To further improve efficiency, we introduce a lightweight teacher-student interface that bypasses the teacher's input layer, enabling direct distillation on vectorized representations. Experiments across standard NLP benchmarks demonstrate that our 66M-parameter student model…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsHuman Motion and Animation