Exploring Biologically Inspired Mechanisms of Adversarial Robustness

Konstantin Holzhausen; Mia Merlid; H{\aa}kon Olav Torvik and; Anders Malthe-S{\o}renssen; Mikkel Elle Lepper{\o}d

arXiv:2405.00679·cs.NE·February 4, 2025

Exploring Biologically Inspired Mechanisms of Adversarial Robustness

Konstantin Holzhausen, Mia Merlid, H{\aa}kon Olav Torvik and, Anders Malthe-S{\o}renssen, Mikkel Elle Lepper{\o}d

PDF

TL;DR

This paper investigates how biological neural mechanisms can inform the development of more robust artificial neural networks by analyzing spectral properties, representation smoothness, and learning dynamics.

Contribution

It demonstrates that unsupervised local learning models with winner-takes-all dynamics can learn power law spectral representations linked to robustness.

Findings

01

Power law spectral representations are learned by local unsupervised models.

02

Spectral and weight regularization influence robustness and expressivity.

03

Biological insights can guide the design of more stable artificial neural networks.

Abstract

Backpropagation-optimized artificial neural networks, while precise, lack robustness, leading to unforeseen behaviors that affect their safety. Biological neural systems do solve some of these issues already. Unlike artificial models, biological neurons adjust connectivity based on neighboring cell activity. Understanding the biological mechanisms of robustness can pave the way towards building trust worthy and safe systems. Robustness in neural representations is hypothesized to correlate with the smoothness of the encoding manifold. Recent work suggests power law covariance spectra, which were observed studying the primary visual cortex of mice, to be indicative of a balanced trade-off between accuracy and robustness in representations. Here, we show that unsupervised local learning models with winner takes all dynamics learn such power law representations, providing upcoming studies…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.