MEST: Accurate and Fast Memory-Economic Sparse Training Framework on the   Edge

Geng Yuan; Xiaolong Ma; Wei Niu; Zhengang Li; Zhenglun Kong; Ning Liu,; Yifan Gong; Zheng Zhan; Chaoyang He; Qing Jin; Siyue Wang; Minghai Qin; Bin; Ren; Yanzhi Wang; Sijia Liu; Xue Lin

arXiv:2110.14032·cs.LG·October 28, 2021·41 cites

MEST: Accurate and Fast Memory-Economic Sparse Training Framework on the Edge

Geng Yuan, Xiaolong Ma, Wei Niu, Zhengang Li, Zhenglun Kong, Ning Liu,, Yifan Gong, Zheng Zhan, Chaoyang He, Qing Jin, Siyue Wang, Minghai Qin, Bin, Ren, Yanzhi Wang, Sijia Liu, Xue Lin

PDF

Open Access 1 Repo 1 Video

TL;DR

MEST is a novel sparse training framework designed for edge devices that enhances accuracy and speed through elastic mutation, soft memory bounds, and data efficiency, outperforming state-of-the-art methods on ImageNet.

Contribution

This work introduces MEST, a new sparse training framework with innovative enhancements for accuracy and speed on edge devices, emphasizing the importance of sparsity schemes and data efficiency.

Findings

01

MEST significantly improves Top-1 accuracy on ImageNet.

02

MEST outperforms state-of-the-art sparse training methods.

03

Data efficiency accelerates sparse training by identifying removable examples.

Abstract

Recently, a new trend of exploring sparsity for accelerating neural network training has emerged, embracing the paradigm of training on the edge. This paper proposes a novel Memory-Economic Sparse Training (MEST) framework targeting for accurate and fast execution on edge devices. The proposed MEST framework consists of enhancements by Elastic Mutation (EM) and Soft Memory Bound (&S) that ensure superior accuracy at high sparsity ratios. Different from the existing works for sparse training, this current work reveals the importance of sparsity schemes on the performance of sparse training in terms of accuracy as well as training speed on real edge devices. On top of that, the paper proposes to employ data efficiency for further acceleration of sparse training. Our results suggest that unforgettable examples can be identified in-situ even during the dynamic exploration of sparsity masks…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

boone891214/mest
pytorchOfficial

Videos

MEST: Accurate and Fast Memory-Economic Sparse Training Framework on the Edge· slideslive

Taxonomy

TopicsAdvanced Neural Network Applications · Stochastic Gradient Optimization Techniques · Machine Learning and ELM

MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings