AKD : Adversarial Knowledge Distillation For Large Language Models Alignment on Coding tasks
Ilyas Oulkadda, Julien Perez

TL;DR
This paper introduces AKD, an adversarial knowledge distillation method that improves the robustness, reliability, and efficiency of large language models for code generation by using synthetic adversarial datasets.
Contribution
The paper presents a novel adversarial knowledge distillation framework that enhances Code-LLMs' capabilities and robustness using synthetic adversarial data, addressing data scarcity and model scaling issues.
Findings
AKD improves model robustness and reliability.
Synthetic adversarial datasets enhance reasoning capabilities.
Smaller models retain performance comparable to larger ones.
Abstract
The widespread adoption of Large Language Models (LLMs) for code generation, exemplified by GitHub Copilot\footnote{A coding extension powered by a Code-LLM to assist in code completion tasks} surpassing a million users, highlights the transformative potential of these tools in improving developer productivity. However, this rapid growth also underscores critical concerns regarding the quality, safety, and reliability of the code they generate. As Code-LLMs evolve, they face significant challenges, including the diminishing returns of model scaling and the scarcity of new, high-quality training data. To address these issues, this paper introduces Adversarial Knowledge Distillation (AKD), a novel approach that leverages adversarially generated synthetic datasets to distill the capabilities of larger models into smaller, more efficient ones. By systematically stress-testing and refining…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Machine Learning in Materials Science · Topic Modeling
MethodsKnowledge Distillation
