TL;DR
This paper investigates the applicability of the Lottery Ticket Hypothesis to object recognition tasks, revealing challenges in transferability and providing methods to find highly sparse subnetworks without performance loss.
Contribution
First empirical study applying LTH to object detection, segmentation, and keypoint estimation, with insights on transferability and sparsity in these tasks.
Findings
Lottery tickets from ImageNet do not transfer well to downstream tasks.
Up to 80% sparsity achievable without performance drop.
Guidelines for finding sparse subnetworks across different object recognition sub-tasks.
Abstract
Recognition tasks, such as object recognition and keypoint estimation, have seen widespread adoption in recent years. Most state-of-the-art methods for these tasks use deep networks that are computationally expensive and have huge memory footprints. This makes it exceedingly difficult to deploy these systems on low power embedded devices. Hence, the importance of decreasing the storage requirements and the amount of computation in such models is paramount. The recently proposed Lottery Ticket Hypothesis (LTH) states that deep neural networks trained on large datasets contain smaller subnetworks that achieve on par performance as the dense networks. In this work, we perform the first empirical study investigating LTH for model pruning in the context of object detection, instance segmentation, and keypoint estimation. Our studies reveal that lottery tickets obtained from ImageNet…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsPruning
