Aux-NAS: Exploiting Auxiliary Labels with Negligibly Extra Inference Cost
Yuan Gao, Weizhong Zhang, Wenhan Luo, Lin Ma, Jin-Gang Yu, Gui-Song, Xia, Jiayi Ma

TL;DR
Aux-NAS introduces a novel architecture-based NAS method that leverages auxiliary labels to improve primary task performance without increasing inference cost, by evolving networks with only primary-to-auxiliary links during inference.
Contribution
It proposes a NAS-based approach to exploit auxiliary labels through architecture design, enabling different training and inference networks with negligible extra inference cost.
Findings
Achieves improved primary task performance across six tasks.
Effectively removes auxiliary connections during inference.
Compatible with various backbones and auxiliary learning methods.
Abstract
We aim at exploiting additional auxiliary labels from an independent (auxiliary) task to boost the primary task performance which we focus on, while preserving a single task inference cost of the primary task. While most existing auxiliary learning methods are optimization-based relying on loss weights/gradients manipulation, our method is architecture-based with a flexible asymmetric structure for the primary and auxiliary tasks, which produces different networks for training and inference. Specifically, starting from two single task networks/branches (each representing a task), we propose a novel method with evolving networks where only primary-to-auxiliary links exist as the cross-task connections after convergence. These connections can be removed during the primary task inference, resulting in a single-task inference cost. We achieve this by formulating a Neural Architecture Search…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsCryptography and Data Security · Spam and Phishing Detection · Internet Traffic Analysis and Secure E-voting
MethodsAverage Pooling · Kaiming Initialization · Max Pooling · Global Average Pooling · Dropout · Softmax · Dense Connections · Convolution · Focus
