Puppet-CNN: Continuous Parameter Dynamics for Input-Adaptive Convolutional Networks
Yucheng Xing, Xin Wang

TL;DR
Puppet-CNN models convolutional layer parameters as a continuous dynamical system using neural ODEs, enabling input-adaptive computation and reducing parameter storage while maintaining competitive accuracy.
Contribution
Introduces Puppet-CNN, a novel framework that represents CNN parameters as states evolving via neural ODEs, allowing adaptive depth and parameter efficiency.
Findings
Achieves competitive accuracy on image classification benchmarks.
Reduces the number of stored trainable parameters significantly.
Enables input-dependent adaptation of network complexity.
Abstract
Modern convolutional neural networks (CNNs) organize computation as a discrete stack of layers whose parameters are independently stored and learned, with the number of layers fixed as an architectural hyperparameter. In this work, we explore an alternative perspective: can network parameterization itself be modeled as a continuous dynamical system? We introduce Puppet-CNN, a framework that represents convolutional layer parameters as states evolving along a learned parameter flow governed by a neural ordinary differential equation (ODE). Under this formulation, layer parameters are generated through continuous evolution in parameter space, and the effective number of generated layers is determined by the integration horizon of the learned dynamics, which can be modulated by input complexity to enable input-adaptive computation. We validate this formulation on standard image…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeural Networks and Applications · Model Reduction and Neural Networks
MethodsSparse Evolutionary Training · ADaptive gradient method with the OPTimal convergence rate
