A Structured Sparse Neural Network and Its Matrix Calculations Algorithm
Seyyed Mostafa Mousavi Janbeh Sarayi, Mansour Nikkhah Bahrami

TL;DR
This paper introduces a novel matrix calculation algorithm for structured sparse neural networks using nonsymmetric tridiagonal matrices, significantly improving computational efficiency for large datasets and deep networks.
Contribution
It develops a direct, efficient algorithm for inverting and computing pseudoinverses of nonsymmetric tridiagonal matrices with off-diagonal sparsity, enhancing neural network training methods.
Findings
Significant reduction in computational costs for large matrices.
Effective handling of underfitting and overfitting through structured sparsity.
Algorithm tested successfully on randomly generated matrices of varying sizes.
Abstract
Gradient descent optimizations and backpropagation are the most common methods for training neural networks, but they are computationally expensive for real time applications, need high memory resources, and are difficult to converge for many networks and large datasets. [Pseudo]inverse models for training neural network have emerged as powerful tools to overcome these issues. In order to effectively implement these methods, structured pruning maybe be applied to produce sparse neural networks. Although sparse neural networks are efficient in memory usage, most of their algorithms use the same fully loaded matrix calculation methods which are not efficient for sparse matrices. Tridiagonal matrices are one of the frequently used candidates for structuring neural networks, but they are not flexible enough to handle underfitting and overfitting problems as well as generalization…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeural Networks and Applications · Matrix Theory and Algorithms · Machine Learning and ELM
MethodsPruning
