TL;DR
This paper introduces a knowledge evolution approach for training deep neural networks on small datasets, improving accuracy and reducing inference costs by iteratively evolving network knowledge.
Contribution
The paper proposes a novel evolution-inspired training method that enhances performance and creates slim networks suitable for small datasets, integrating seamlessly with existing architectures.
Findings
Achieves 21% accuracy improvement over baseline
Reduces inference cost by 73%
State-of-the-art results on classification benchmarks
Abstract
Deep learning relies on the availability of a large corpus of data (labeled or unlabeled). Thus, one challenging unsettled question is: how to train a deep network on a relatively small dataset? To tackle this question, we propose an evolution-inspired training approach to boost performance on relatively small datasets. The knowledge evolution (KE) approach splits a deep network into two hypotheses: the fit-hypothesis and the reset-hypothesis. We iteratively evolve the knowledge inside the fit-hypothesis by perturbing the reset-hypothesis for multiple generations. This approach not only boosts performance, but also learns a slim network with a smaller inference cost. KE integrates seamlessly with both vanilla and residual convolutional networks. KE reduces both overfitting and the burden for data collection. We evaluate KE on various network architectures and loss functions. We…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
