Rocket Launching: A Universal and Efficient Framework for Training Well-performing Light Net
Guorui Zhou, Ying Fan, Runpeng Cui, Weijie Bian, Xiaoqiang Zhu, Kun, Gai

TL;DR
This paper introduces a 'rocket launching' framework where a complex booster network guides the training of a lightweight model, significantly improving its performance for real-time tasks under computational constraints.
Contribution
The paper proposes a novel training framework using a booster network to enhance lightweight model performance, with analysis of loss functions and a gradient blocking technique.
Findings
Lightweight models achieve performance comparable to complex models.
The approach improves real-time prediction accuracy.
Effective on benchmark and industrial datasets.
Abstract
Models applied on real time response task, like click-through rate (CTR) prediction model, require high accuracy and rigorous response time. Therefore, top-performing deep models of high depth and complexity are not well suited for these applications with the limitations on the inference time. In order to further improve the neural networks' performance given the time and computational limitations, we propose an approach that exploits a cumbersome net to help train the lightweight net for prediction. We dub the whole process rocket launching, where the cumbersome booster net is used to guide the learning of the target light net throughout the whole training process. We analyze different loss functions aiming at pushing the light net to behave similarly to the booster net, and adopt the loss with best performance in our experiments. We use one technique called gradient block to improve…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and Data Classification · Machine Learning in Materials Science · Advanced Neural Network Applications
