Adaptive Error-Bounded Hierarchical Matrices for Efficient Neural Network Compression
John Mango, Ronald Katende

TL;DR
This paper presents a dynamic, error-bounded hierarchical matrix compression technique for Physics-Informed Neural Networks that improves efficiency, accuracy, and real-time inference capabilities in large-scale scientific models.
Contribution
It introduces an adaptive hierarchical matrix method that reduces computational costs while maintaining model accuracy, outperforming traditional compression techniques.
Findings
Outperforms SVD, pruning, and quantization in accuracy.
Reduces computational complexity and memory usage.
Enhances inference speed for real-time applications.
Abstract
This paper introduces a dynamic, error-bounded hierarchical matrix (H-matrix) compression method tailored for Physics-Informed Neural Networks (PINNs). The proposed approach reduces the computational complexity and memory demands of large-scale physics-based models while preserving the essential properties of the Neural Tangent Kernel (NTK). By adaptively refining hierarchical matrix approximations based on local error estimates, our method ensures efficient training and robust model performance. Empirical results demonstrate that this technique outperforms traditional compression methods, such as Singular Value Decomposition (SVD), pruning, and quantization, by maintaining high accuracy and improving generalization capabilities. Additionally, the dynamic H-matrix method enhances inference speed, making it suitable for real-time applications. This approach offers a scalable and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeural Networks and Applications · Blind Source Separation Techniques
