A Unified Perspective on Optimization in Machine Learning and Neuroscience: From Gradient Descent to Neural Adaptation
Jes\'us Garc\'ia Fern\'andez, Nasir Ahmad, Marcel van Gerven

TL;DR
This paper offers a unified view of optimization techniques in AI and neuroscience, highlighting how zeroth-order methods and biological learning share core principles, with implications for scalable AI and understanding brain function.
Contribution
It bridges gradient-based and zeroth-order optimization methods, connecting machine learning algorithms with biological learning mechanisms through a unified framework.
Findings
Modern ZO methods can approximate gradients effectively.
Biological learning parallels ZO principles of exploration and feedback.
Zeroth-order paradigm offers insights into brain learning and neuromorphic hardware.
Abstract
Iterative optimization is central to modern artificial intelligence (AI) and provides a crucial framework for understanding adaptive systems. This review provides a unified perspective on this subject, bridging classic theory with neural network training and biological learning. Although gradient-based methods, powered by the efficient but biologically implausible backpropagation (BP), dominate machine learning, their computational demands can hinder scalability in high-dimensional settings. In contrast, derivative-free or zeroth-order (ZO) optimization feature computationally lighter approaches that rely only on function evaluations and randomness. While generally less sample efficient, recent breakthroughs demonstrate that modern ZO methods can effectively approximate gradients and achieve performance competitive with BP in neural network models. This ZO paradigm is also particularly…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Memory and Neural Computing · Neural Networks and Reservoir Computing · Stochastic Gradient Optimization Techniques
