Learning Theory of Distribution Regression with Neural Networks
Zhongjie Shi, Zhan Yu, Ding-Xuan Zhou

TL;DR
This paper develops a theoretical framework for using neural networks to perform distribution regression, where inputs are probability measures, establishing approximation capabilities and learning rates.
Contribution
It introduces a novel neural network structure for distribution inputs and provides the first mathematical analysis of its approximation and learning properties.
Findings
Established a neural network framework for distribution regression.
Proved almost optimal learning rates for the proposed model.
Provided a two-stage error decomposition technique for analysis.
Abstract
In this paper, we aim at establishing an approximation theory and a learning theory of distribution regression via a fully connected neural network (FNN). In contrast to the classical regression methods, the input variables of distribution regression are probability measures. Then we often need to perform a second-stage sampling process to approximate the actual information of the distribution. On the other hand, the classical neural network structure requires the input variable to be a vector. When the input samples are probability distributions, the traditional deep neural network method cannot be directly used and the difficulty arises for distribution regression. A well-defined neural network structure for distribution inputs is intensively desirable. There is no mathematical model and theoretical analysis on neural network realization of distribution regression. To overcome…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeural Networks and Applications · Control Systems and Identification · Fault Detection and Control Systems
