Extraction of linearized models from pre-trained networks via knowledge distillation

Fumito Kimura; Jun Ohkubo

arXiv:2604.06732·cs.LG·April 9, 2026

Extraction of linearized models from pre-trained networks via knowledge distillation

Fumito Kimura, Jun Ohkubo

PDF

TL;DR

This paper introduces a method to derive linear models from pre-trained neural networks using Koopman operator theory and knowledge distillation, improving classification accuracy and stability on MNIST datasets.

Contribution

It presents a novel framework combining Koopman theory and knowledge distillation to extract linearized models from neural networks for classification.

Findings

01

The proposed model outperforms least-squares Koopman approximation in accuracy.

02

It demonstrates improved numerical stability over traditional methods.

03

Effective on MNIST and Fashion-MNIST datasets.

Abstract

Recent developments in hardware, such as photonic integrated circuits and optical devices, are driving demand for research on constructing machine learning architectures tailored for linear operations. Hence, it is valuable to explore methods for constructing learning machines with only linear operations after simple nonlinear preprocessing. In this study, we propose a framework to extract a linearized model from a pre-trained neural network for classification tasks by integrating Koopman operator theory with knowledge distillation. Numerical demonstrations on the MNIST and the Fashion-MNIST datasets reveal that the proposed model consistently outperforms the conventional least-squares-based Koopman approximation in both classification accuracy and numerical stability.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.