Spatial Ensemble: a Novel Model Smoothing Mechanism for Student-Teacher Framework
Tengteng Huang, Yifan Sun, Xun Wang, Haotian Yao, Chi Zhang

TL;DR
This paper introduces Spatial Ensemble, a new model smoothing technique for student-teacher frameworks that combines fragments of historical student models to improve learning performance and complements existing temporal smoothing methods.
Contribution
The paper proposes Spatial Ensemble, a novel model smoothing mechanism that stitches fragments of historical student models, enhancing student-teacher training effectiveness.
Findings
Spatial Ensemble achieves comparable performance to TMA.
Combining Spatial Ensemble with TMA improves results significantly.
On ImageNet, it improves BYOL accuracy by 0.9%. on CIFAR-10 with FixMatch, accuracy increases by 6%.
Abstract
Model smoothing is of central importance for obtaining a reliable teacher model in the student-teacher framework, where the teacher generates surrogate supervision signals to train the student. A popular model smoothing method is the Temporal Moving Average (TMA), which continuously averages the teacher parameters with the up-to-date student parameters. In this paper, we propose "Spatial Ensemble", a novel model smoothing mechanism in parallel with TMA. Spatial Ensemble randomly picks up a small fragment of the student model to directly replace the corresponding fragment of the teacher model. Consequentially, it stitches different fragments of historical student models into a unity, yielding the "Spatial Ensemble" effect. Spatial Ensemble obtains comparable student-teacher learning performance by itself and demonstrates valuable complementarity with temporal moving average. Their…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsAdvanced Neural Network Applications · Domain Adaptation and Few-Shot Learning · Machine Learning and ELM
MethodsFixMatch · Bootstrap Your Own Latent
