Error Bounds for a Matrix-Vector Product Approximation with Deep ReLU Neural Networks
Tilahun M. Getu

TL;DR
This paper develops theoretical error bounds for approximating matrix-vector products using deep ReLU neural networks and demonstrates their practical effectiveness, with implications for various scientific fields.
Contribution
The paper provides the first rigorous error bounds for deep ReLU networks approximating matrix-vector products, guiding training and applications in multiple domains.
Findings
Derived error bounds in Lebesgue and Sobolev norms.
Successfully trained deep ReLU networks matching theoretical predictions.
Applicable to teacher-student training paradigms in AI and ML.
Abstract
Among the several paradigms of artificial intelligence (AI) or machine learning (ML), a remarkably successful paradigm is deep learning. Deep learning's phenomenal success has been hoped to be interpreted via fundamental research on the theory of deep learning. Accordingly, applied research on deep learning has spurred the theory of deep learning-oriented depth and breadth of developments. Inspired by such developments, we pose these fundamental questions: can we accurately approximate an arbitrary matrix-vector product using deep rectified linear unit (ReLU) feedforward neural networks (FNNs)? If so, can we bound the resulting approximation error? In light of these questions, we derive error bounds in Lebesgue and Sobolev norms that comprise our developed deep approximation theory. Guided by this theory, we have successfully trained deep ReLU FNNs whose test results justify our…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
