Training Deep Networks with Structured Layers by Matrix Backpropagation
Catalin Ionescu, Orestis Vantzos, Cristian Sminchisescu

TL;DR
This paper introduces a novel matrix backpropagation method enabling the integration of global structured matrix computations into deep neural networks, demonstrated through improved visual segmentation results.
Contribution
It develops a mathematical framework for backpropagation through structured matrix layers, allowing end-to-end training of deep networks with global computation modules.
Findings
Deep networks with structured matrix layers outperform traditional models.
Matrix backpropagation enables efficient training of global computation layers.
Experimental results on segmentation benchmarks show improved accuracy.
Abstract
Deep neural network architectures have recently produced excellent results in a variety of areas in artificial intelligence and visual recognition, well surpassing traditional shallow architectures trained using hand-designed features. The power of deep networks stems both from their ability to perform local computations followed by pointwise non-linearities over increasingly larger receptive fields, and from the simplicity and scalability of the gradient-descent training procedure based on backpropagation. An open problem is the inclusion of layers that perform global, structured matrix computations like segmentation (e.g. normalized cuts) or higher-order pooling (e.g. log-tangent space metrics defined over the manifold of symmetric positive definite matrices) while preserving the validity and efficiency of an end-to-end deep training framework. In this paper we propose a sound…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Sparse and Compressive Sensing Techniques · Domain Adaptation and Few-Shot Learning
