Learning latent variable structured prediction models with Gaussian   perturbations

Kevin Bello; Jean Honorio

arXiv:1805.09213·cs.LG·June 25, 2019

Learning latent variable structured prediction models with Gaussian perturbations

Kevin Bello, Jean Honorio

PDF

Open Access

TL;DR

This paper introduces a novel approach for structured prediction models with latent variables using Gaussian perturbations, providing theoretical analysis and practical algorithms that improve learning efficiency and generalization bounds.

Contribution

It extends existing random sampling methods to include latent variables, offering a tighter upper bound on Gibbs decoder distortion and faster evaluation techniques.

Findings

01

The method achieves tighter bounds on model distortion.

02

It demonstrates improved efficiency in learning with latent variables.

03

Experimental results validate the theoretical advantages.

Abstract

The standard margin-based structured prediction commonly uses a maximum loss over all possible structured outputs. The large-margin formulation including latent variables not only results in a non-convex formulation but also increases the search space by a factor of the size of the latent space. Recent work has proposed the use of the maximum loss over random structured outputs sampled independently from some proposal distribution, with theoretical guarantees. We extend this work by including latent variables. We study a new family of loss functions under Gaussian perturbations and analyze the effect of the latent space on the generalization bounds. We show that the non-convexity of learning with latent variables originates naturally, as it relates to a tight upper bound of the Gibbs decoder distortion with respect to the latent space. Finally, we provide a formulation using random…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsBayesian Methods and Mixture Models · Machine Learning and Algorithms · Gaussian Processes and Bayesian Inference