Target Aware Network Architecture Search and Compression for Efficient Knowledge Transfer
S.H.Shabbeer Basha, Debapriya Tula, Sravan Kumar Vinakota, Shiv Ram, Dubey

TL;DR
TASCNet is a two-stage framework that automatically configures and prunes CNNs for efficient transfer learning, reducing complexity while maintaining performance across vision and sentiment analysis tasks.
Contribution
It introduces a novel two-stage process for automatic network configuration and pruning, optimizing CNN structures for target tasks in transfer learning.
Findings
Reduces CNN parameters and FLOPs significantly.
Maintains high accuracy after pruning.
Effective across multiple datasets and tasks.
Abstract
Transfer Learning enables Convolutional Neural Networks (CNN) to acquire knowledge from a source domain and transfer it to a target domain, where collecting large-scale annotated examples is time-consuming and expensive. Conventionally, while transferring the knowledge learned from one task to another task, the deeper layers of a pre-trained CNN are finetuned over the target dataset. However, these layers are originally designed for the source task which may be over-parameterized for the target task. Thus, finetuning these layers over the target dataset may affect the generalization ability of the CNN due to high network complexity. To tackle this problem, we propose a two-stage framework called TASCNet which enables efficient knowledge transfer. In the first stage, the configuration of the deeper layers is learned automatically and finetuned over the target dataset. Later, in the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsEnergy Efficient Wireless Sensor Networks
