Towards an Optimal Control Perspective of ResNet Training

Jens P\"uttschneider; Simon Heilig; Asja Fischer; Timm Faulwasser

arXiv:2506.21453·cs.LG·June 27, 2025

Towards an Optimal Control Perspective of ResNet Training

Jens P\"uttschneider, Simon Heilig, Asja Fischer, Timm Faulwasser

PDF

Open Access

TL;DR

This paper formulates ResNet training as an optimal control problem, leading to a dynamic that encourages pruning of unnecessary layers, potentially improving efficiency and understanding of residual networks.

Contribution

It introduces an optimal control-based training formulation for ResNets, linking control theory with neural network training and suggesting a new layer pruning strategy.

Findings

01

Biases weights of unnecessary residual layers to vanish

02

Provides a theoretical framework connecting control theory and ResNet training

03

Suggests a new approach for layer pruning based on optimal control principles

Abstract

We propose a training formulation for ResNets reflecting an optimal control problem that is applicable for standard architectures and general loss functions. We suggest bridging both worlds via penalizing intermediate outputs of hidden states corresponding to stage cost terms in optimal control. For standard ResNets, we obtain intermediate outputs by propagating the state through the subsequent skip connections and the output layer. We demonstrate that our training dynamic biases the weights of the unnecessary deeper residual layers to vanish. This indicates the potential for a theory-grounded layer pruning strategy.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNeural Networks and Applications

MethodsPruning