A Deep Risk Estimator for Known Operator Learning
Andreas Maier, Md Hasan, Paulina Conrad, Paula Andrea Perez-Toro

TL;DR
This paper introduces a risk estimator for deep networks with both learned and known operators, linking expected error to training sample size and enabling better understanding of model complexity and data requirements.
Contribution
The authors derive a novel deep risk estimator that decomposes total risk based on known and learned layers, providing insights into sample complexity and model design.
Findings
The risk bound decreases when replacing learned layers with known operators.
Sample requirement scales with the number of trainable parameters in learned layers.
The estimator accurately predicts test error and training sample needs across CT reconstruction and physics-informed neural networks.
Abstract
We describe an approach for estimating the statistical risk of deep networks that contain a mix of learned and known operators. Building on the maximal training error bounds previously established for known operator learning, we derive a deep risk estimator that connects the expected error of a layered network to the size of the training sample. The estimator decomposes the total risk into a sum over learned layers; every known operator contributes zero to this sum, while every learned layer adds an approximation term inspired by Barron's classic work and an estimation term that decreases with the number of training samples. We are able to show that the bound shrinks whenever a learned layer is replaced by a known operator and that the corresponding sample requirement scales with the number of trainable parameters of the layer that is replaced. As an application, we use computed…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
