A scaled gradient projection method for Bayesian learning in dynamical systems
Silvia Bonettini, Alessandro Chiuso, Marco Prato

TL;DR
This paper introduces a scaled gradient projection method for Bayesian system identification, efficiently solving nonconvex optimization problems to select optimal model classes with high accuracy and computational speed.
Contribution
It proposes a novel scaled gradient projection algorithm with a tailored scaling matrix and step size for Bayesian model selection in dynamical systems.
Findings
Method achieves solutions in a fraction of a second.
Results are comparable to state-of-the-art approaches.
Algorithm is adaptable to various machine learning and signal processing problems.
Abstract
A crucial task in system identification problems is the selection of the most appropriate model class, and is classically addressed resorting to cross-validation or using asymptotic arguments. As recently suggested in the literature, this can be addressed in a Bayesian framework, where model complexity is regulated by few hyperparameters, which can be estimated via marginal likelihood maximization. It is thus of primary importance to design effective optimization methods to solve the corresponding optimization problem. If the unknown impulse response is modeled as a Gaussian process with a suitable kernel, the maximization of the marginal likelihood leads to a challenging nonconvex optimization problem, which requires a stable and effective solution strategy. In this paper we address this problem by means of a scaled gradient projection algorithm, in which the scaling matrix and the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsGaussian Process
