Auto-Compressing Networks

Vaggelis Dorovatas; Georgios Paraskevopoulos; Alexandros Potamianos

arXiv:2506.09714·cs.LG·November 10, 2025

Auto-Compressing Networks

Vaggelis Dorovatas, Georgios Paraskevopoulos, Alexandros Potamianos

PDF

Open Access 1 Video

TL;DR

Auto-Compressing Networks (ACNs) introduce a novel architecture with long feedforward connections that enable the network to organically compress information during training, improving efficiency, robustness, and continual learning capabilities.

Contribution

This paper proposes ACNs, a new neural network architecture that replaces residual connections with long feedforward links, enabling automatic information compression and improved performance.

Findings

01

ACNs achieve up to 30-80% architectural compression.

02

ACNs reduce catastrophic forgetting by up to 18%.

03

ACNs demonstrate enhanced robustness and transfer learning capabilities.

Abstract

Deep neural networks with short residual connections have demonstrated remarkable success across domains, but increasing depth often introduces computational redundancy without corresponding improvements in representation quality. We introduce Auto-Compressing Networks (ACNs), an architectural variant where additive long feedforward connections from each layer to the output replace traditional short residual connections. By analyzing the distinct dynamics induced by this modification, we reveal a unique property we coin as auto-compression, the ability of a network to organically compress information during training with gradient descent, through architectural design alone. Through auto-compression, information is dynamically "pushed" into early layers during training, enhancing their representational quality and revealing potential redundancy in deeper ones. We theoretically show that…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Auto-Compressing Networks· slideslive

Taxonomy

TopicsDomain Adaptation and Few-Shot Learning · Advanced Neural Network Applications · Generative Adversarial Networks and Image Synthesis

MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Linear Layer · Attention Is All You Need · WordPiece · Multi-Head Attention · Softmax · Layer Normalization · Adam · Linear Warmup With Linear Decay · Weight Decay