A scaled gradient projection method for Bayesian learning in dynamical   systems

Silvia Bonettini; Alessandro Chiuso; Marco Prato

arXiv:1406.6603·math.NA·June 9, 2015

A scaled gradient projection method for Bayesian learning in dynamical systems

Silvia Bonettini, Alessandro Chiuso, Marco Prato

PDF

TL;DR

This paper introduces a scaled gradient projection method for Bayesian system identification, efficiently solving nonconvex optimization problems to select optimal model classes with high accuracy and computational speed.

Contribution

It proposes a novel scaled gradient projection algorithm with a tailored scaling matrix and step size for Bayesian model selection in dynamical systems.

Findings

01

Method achieves solutions in a fraction of a second.

02

Results are comparable to state-of-the-art approaches.

03

Algorithm is adaptable to various machine learning and signal processing problems.

Abstract

A crucial task in system identification problems is the selection of the most appropriate model class, and is classically addressed resorting to cross-validation or using asymptotic arguments. As recently suggested in the literature, this can be addressed in a Bayesian framework, where model complexity is regulated by few hyperparameters, which can be estimated via marginal likelihood maximization. It is thus of primary importance to design effective optimization methods to solve the corresponding optimization problem. If the unknown impulse response is modeled as a Gaussian process with a suitable kernel, the maximization of the marginal likelihood leads to a challenging nonconvex optimization problem, which requires a stable and effective solution strategy. In this paper we address this problem by means of a scaled gradient projection algorithm, in which the scaling matrix and the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsGaussian Process