Exploring Biologically Inspired Mechanisms of Adversarial Robustness
Konstantin Holzhausen, Mia Merlid, H{\aa}kon Olav Torvik and, Anders Malthe-S{\o}renssen, Mikkel Elle Lepper{\o}d

TL;DR
This paper investigates how biological neural mechanisms can inform the development of more robust artificial neural networks by analyzing spectral properties, representation smoothness, and learning dynamics.
Contribution
It demonstrates that unsupervised local learning models with winner-takes-all dynamics can learn power law spectral representations linked to robustness.
Findings
Power law spectral representations are learned by local unsupervised models.
Spectral and weight regularization influence robustness and expressivity.
Biological insights can guide the design of more stable artificial neural networks.
Abstract
Backpropagation-optimized artificial neural networks, while precise, lack robustness, leading to unforeseen behaviors that affect their safety. Biological neural systems do solve some of these issues already. Unlike artificial models, biological neurons adjust connectivity based on neighboring cell activity. Understanding the biological mechanisms of robustness can pave the way towards building trust worthy and safe systems. Robustness in neural representations is hypothesized to correlate with the smoothness of the encoding manifold. Recent work suggests power law covariance spectra, which were observed studying the primary visual cortex of mice, to be indicative of a balanced trade-off between accuracy and robustness in representations. Here, we show that unsupervised local learning models with winner takes all dynamics learn such power law representations, providing upcoming studies…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
