Teaching the Teacher: The Role of Teacher-Student Smoothness Alignment in Genetic Programming-based Symbolic Distillation

Soumyadeep Dhar; Kei Sen Fong; Mehul Motani

arXiv:2507.22767·cs.LG·April 15, 2026

Teaching the Teacher: The Role of Teacher-Student Smoothness Alignment in Genetic Programming-based Symbolic Distillation

Soumyadeep Dhar, Kei Sen Fong, Mehul Motani

PDF

TL;DR

This paper explores how aligning the functional smoothness of neural network teachers with symbolic students improves the accuracy of symbolic distillation, advancing explainable AI.

Contribution

It introduces a regularization framework that enforces teacher smoothness to enhance symbolic distillation accuracy, addressing a key gap in current methods.

Findings

01

Smoothness-regularized teachers lead to higher R^2 scores in students.

02

Regularization improves the alignment between teacher and student models.

03

Statistically significant accuracy improvements across 20 datasets.

Abstract

Obtaining human-readable symbolic formulas via genetic programming-based symbolic distillation of a deep neural network trained on the target dataset presents a promising yet underexplored path towards explainable artificial intelligence (XAI); however, the standard pipeline frequently yields symbolic models with poor predictive accuracy. We identify a fundamental misalignment in functional complexity as the primary barrier to achieving better accuracy: standard Artificial Neural Networks (ANNs) often learn accurate but highly irregular functions, while Symbolic Regression typically prioritizes parsimony, often resulting in a much simpler class of models that are unable to sufficiently distill or learn from the ANN teacher. To bridge this gap, we propose a framework that actively regularizes the teacher's functional smoothness using Jacobian and Lipschitz penalties, aiming to distill…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.