Stitchable Neural Networks
Zizheng Pan, Jianfei Cai, Bohan Zhuang

TL;DR
SN-Net is a scalable framework that creates adaptable neural networks by stitching together pretrained models, enabling efficient deployment with dynamic accuracy and resource trade-offs.
Contribution
This work introduces SN-Net, a novel method for assembling pretrained models into a single adaptable network through stitching, supporting flexible deployment scenarios.
Findings
SN-Net achieves comparable or better performance than individual models.
It enables instant adaptation to resource constraints at runtime.
Supports diverse models like Swin Transformers for efficient deployment.
Abstract
The public model zoo containing enormous powerful pretrained model families (e.g., ResNet/DeiT) has reached an unprecedented scope than ever, which significantly contributes to the success of deep learning. As each model family consists of pretrained models with diverse scales (e.g., DeiT-Ti/S/B), it naturally arises a fundamental question of how to efficiently assemble these readily available models in a family for dynamic accuracy-efficiency trade-offs at runtime. To this end, we present Stitchable Neural Networks (SN-Net), a novel scalable and efficient framework for model deployment. It cheaply produces numerous networks with different complexity and performance trade-offs given a family of pretrained neural networks, which we call anchors. Specifically, SN-Net splits the anchors across the blocks/layers and then stitches them together with simple stitching layers to map the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · COVID-19 diagnosis using AI · Human Pose and Action Recognition
