Auto-Compressing Networks
Vaggelis Dorovatas, Georgios Paraskevopoulos, Alexandros Potamianos

TL;DR
Auto-Compressing Networks (ACNs) introduce a novel architecture with long feedforward connections that enable the network to organically compress information during training, improving efficiency, robustness, and continual learning capabilities.
Contribution
This paper proposes ACNs, a new neural network architecture that replaces residual connections with long feedforward links, enabling automatic information compression and improved performance.
Findings
ACNs achieve up to 30-80% architectural compression.
ACNs reduce catastrophic forgetting by up to 18%.
ACNs demonstrate enhanced robustness and transfer learning capabilities.
Abstract
Deep neural networks with short residual connections have demonstrated remarkable success across domains, but increasing depth often introduces computational redundancy without corresponding improvements in representation quality. We introduce Auto-Compressing Networks (ACNs), an architectural variant where additive long feedforward connections from each layer to the output replace traditional short residual connections. By analyzing the distinct dynamics induced by this modification, we reveal a unique property we coin as auto-compression, the ability of a network to organically compress information during training with gradient descent, through architectural design alone. Through auto-compression, information is dynamically "pushed" into early layers during training, enhancing their representational quality and revealing potential redundancy in deeper ones. We theoretically show that…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Advanced Neural Network Applications · Generative Adversarial Networks and Image Synthesis
MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Linear Layer · Attention Is All You Need · WordPiece · Multi-Head Attention · Softmax · Layer Normalization · Adam · Linear Warmup With Linear Decay · Weight Decay
