A Metamorphic Testing Perspective on Knowledge Distillation for Language Models of Code: Does the Student Deeply Mimic the Teacher?
Md. Abdul Awal, Mrigank Rochan, Chanchal K. Roy

TL;DR
This paper introduces MetaCompress, a metamorphic testing framework to evaluate behavioral fidelity in knowledge-distilled language models of code, revealing significant discrepancies not captured by accuracy metrics.
Contribution
It presents a novel metamorphic testing approach for assessing deep behavioral mimicry in student models, highlighting limitations of traditional evaluation methods.
Findings
Student models often fail to mimic teacher behavior, leading to up to 285% performance drops under adversarial attacks.
MetaCompress detects up to 62% behavioral discrepancies in student models.
Behavioral fidelity evaluation is crucial for effective knowledge distillation of language models.
Abstract
Transformer-based language models of code have achieved state-of-the-art performance across a wide range of software analytics tasks, but their practical deployment remains limited due to high computational costs, slow inference speeds, and significant environmental impact. To address these challenges, recent research has increasingly explored knowledge distillation as a method for compressing a large language model of code (the teacher) into a smaller model (the student) while maintaining performance. However, the degree to which a student model deeply mimics the predictive behavior and internal representations of its teacher remains largely unexplored, as current accuracy-based evaluation provides only a surface-level view of model quality and often fails to capture more profound discrepancies in behavioral fidelity between the teacher and student models. To address this gap, we…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
