TL;DR
This paper introduces SeAl-KD, a novel knowledge distillation method for spiking neural networks that selectively aligns knowledge at important timesteps, improving performance over existing methods.
Contribution
The paper proposes a selective alignment approach for knowledge distillation in SNNs, addressing the issue of uniform treatment of all timesteps and enhancing distillation effectiveness.
Findings
SeAl-KD improves SNN performance on various datasets.
Selective alignment reduces erroneous timestep influence.
The method outperforms existing distillation techniques.
Abstract
Spiking neural networks (SNNs), which are brain-inspired and spike-driven, achieve high energy efficiency. However, a performance gap between SNNs and artificial neural networks (ANNs) still remains. Knowledge distillation (KD) is commonly adopted to improve SNN performance, but existing methods typically enforce uniform alignment across all timesteps, either from a teacher network or through inter-temporal self-distillation, implicitly assuming that per-timestep predictions should be treated equally. In practice, SNN predictions vary and evolve over time, and intermediate timesteps need not all be individually correct even when the final aggregated output is correct. Under such conditions, effective distillation should not force every timestep toward the same supervision target, but instead provide corrective guidance to erroneous timesteps while preserving useful temporal dynamics. To…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
