PTSBench: A Comprehensive Post-Training Sparsity Benchmark Towards Algorithms and Models
Zining Wnag, Jinyang Guo, Ruihao Gong, Yang Yong, Aishan Liu, Yushi, Huang, Jiaheng Liu, Xianglong Liu

TL;DR
This paper introduces PTSBench, a comprehensive benchmark for evaluating post-training sparsity algorithms and models, providing new insights, evaluations, and an open-source framework to advance research in model efficiency.
Contribution
It presents the first extensive benchmark for post-training sparsity, evaluating over 40 models and 10 techniques across multiple tasks, offering valuable insights and a reusable framework.
Findings
New observations on PTS algorithms' effectiveness.
In-depth evaluation of models' sparsification ability.
Open-source framework for future research.
Abstract
With the increased attention to model efficiency, post-training sparsity (PTS) has become more and more prevalent because of its effectiveness and efficiency. However, there remain questions on better practice of PTS algorithms and the sparsification ability of models, which hinders the further development of this area. Therefore, a benchmark to comprehensively investigate the issues above is urgently needed. In this paper, we propose the first comprehensive post-training sparsity benchmark called PTSBench towards algorithms and models. We benchmark 10+ PTS general-pluggable fine-grained techniques on 3 typical tasks using over 40 off-the-shelf model architectures. Through extensive experiments and analyses, we obtain valuable conclusions and provide several insights from both algorithms and model aspects. Our PTSBench can provide (1) new observations for a better understanding of the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReservoir Engineering and Simulation Methods · Advanced Data Processing Techniques · Graphite, nuclear technology, radiation studies
MethodsSoftmax · Attention Is All You Need
