Faster Rates for training Max-Margin Markov Networks

Xinhua Zhang (1); Ankan Saha (2); S.V.N. Vishwanathan (1)((1) Purdue; University; (2) University of Chicago)

arXiv:1003.1354·cs.LG·March 9, 2010·1 cites

Faster Rates for training Max-Margin Markov Networks

Xinhua Zhang (1), Ankan Saha (2), S.V.N. Vishwanathan (1)((1) Purdue, University, (2) University of Chicago)

PDF

Open Access

TL;DR

This paper introduces a new optimization technique for max-margin Markov networks that achieves faster convergence rates by leveraging Bregman projections, improving efficiency in structured output prediction tasks.

Contribution

The paper proposes a novel excessive gap reduction method based on Bregman projections that converges faster and maintains graphical model factorization, unlike previous approaches.

Findings

01

Converges in O(1/√ε) iterations, faster than previous methods.

02

Preserves graphical model factorization for tractable computation.

03

Easily kernelized for enhanced flexibility.

Abstract

Structured output prediction is an important machine learning problem both in theory and practice, and the max-margin Markov network (\mcn) is an effective approach. All state-of-the-art algorithms for optimizing \mcn\ objectives take at least $O (1/ ϵ)$ number of iterations to find an $ϵ$ accurate solution. Recent results in structured optimization suggest that faster rates are possible by exploiting the structure of the objective function. Towards this end \citet{Nesterov05} proposed an excessive gap reduction technique based on Euclidean projections which converges in $O (1/ ϵ)$ iterations on strongly convex functions. Unfortunately when applied to \mcn s, this approach does not admit graphical model factorization which, as in many existing algorithms, is crucial for keeping the cost per iteration tractable. In this paper, we present a new excessive gap…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAlgorithms and Data Compression · Machine Learning and Algorithms · Face and Expression Recognition