Transformed Low-Rank Parameterization Can Help Robust Generalization for Tensor Neural Networks
Andong Wang, Chao Li, Mingyuan Bai, Zhong Jin, Guoxu Zhou, Qibin Zhao

TL;DR
This paper investigates how transformed low-rank parameterization influences the generalization ability of tensor neural networks, providing theoretical bounds and practical insights into robustness and adversarial training effects.
Contribution
It is the first to derive generalization bounds for t-NNs with transformed low-rank weights and explains how adversarial training promotes low-rank regularization.
Findings
Transformed low-rank parameterization leads to sharper adversarial generalization bounds.
Adversarial training with gradient flow implicitly regularizes t-NNs towards low-rank weights.
The analysis applies to both exact and approximate transformed low-rank weights.
Abstract
Achieving efficient and robust multi-channel data learning is a challenging task in data science. By exploiting low-rankness in the transformed domain, i.e., transformed low-rankness, tensor Singular Value Decomposition (t-SVD) has achieved extensive success in multi-channel data representation and has recently been extended to function representation such as Neural Networks with t-product layers (t-NNs). However, it still remains unclear how t-SVD theoretically affects the learning behavior of t-NNs. This paper is the first to answer this question by deriving the upper bounds of the generalization error of both standard and adversarially trained t-NNs. It reveals that the t-NNs compressed by exact transformed low-rank parameterization can achieve a sharper adversarial generalization bound. In practice, although t-NNs rarely have exactly transformed low-rank weights, our analysis…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsTensor decomposition and applications · Advanced Neural Network Applications · Computational Physics and Python Applications
