Convolutional Neural Networks Analyzed via Inverse Problem Theory and   Sparse Representations

Cem Tarhan; Gozde Bozdagi Akar

arXiv:1807.07998·cs.LG·October 26, 2018

Convolutional Neural Networks Analyzed via Inverse Problem Theory and Sparse Representations

Cem Tarhan, Gozde Bozdagi Akar

PDF

TL;DR

This paper provides a theoretical analysis of CNNs for inverse imaging problems, demonstrating how they learn optimal solutions as filters and the importance of mutual coherence for convergence.

Contribution

It offers a mathematical validation of CNN training dynamics for inverse problems, linking residual learning and skip connections to mutual coherence and convergence.

Findings

01

CNN filters solve inverse problems during training

02

Mutual coherence is crucial for CNN convergence

03

Residual learning and skip connections enhance coherence and performance

Abstract

Inverse problems in imaging such as denoising, deblurring, superresolution (SR) have been addressed for many decades. In recent years, convolutional neural networks (CNNs) have been widely used for many inverse problem areas. Although their indisputable success, CNNs are not mathematically validated as to how and what they learn. In this paper, we prove that during training, CNN elements solve for inverse problems which are optimum solutions stored as CNN neuron filters. We discuss the necessity of mutual coherence between CNN layer elements in order for a network to converge to the optimum solution. We prove that required mutual coherence can be provided by the usage of residual learning and skip connections. We have set rules over training sets and depth of networks for better convergence, i.e. performance.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.