Fine-grained Generalization Analysis of Structured Output Prediction

Waleed Mustafa; Yunwen Lei; Antoine Ledent; Marius Kloft

arXiv:2106.00115·cs.LG·June 3, 2021

Fine-grained Generalization Analysis of Structured Output Prediction

Waleed Mustafa, Yunwen Lei, Antoine Ledent, Marius Kloft

PDF

TL;DR

This paper improves theoretical understanding of structured output prediction by providing new generalization bounds with logarithmic label set dependency and stability-based bounds independent of label size, applicable to large-scale problems.

Contribution

It introduces novel high-probability and expectation-based generalization bounds for SOPPs with large label sets, extending to weakly dependent data.

Findings

01

Logarithmic dependency on label set size in generalization bounds.

02

Stability-based bounds independent of label set size.

03

Extension to weakly dependent data scenarios.

Abstract

In machine learning we often encounter structured output prediction problems (SOPPs), i.e. problems where the output space admits a rich internal structure. Application domains where SOPPs naturally occur include natural language processing, speech recognition, and computer vision. Typical SOPPs have an extremely large label set, which grows exponentially as a function of the size of the output. Existing generalization analysis implies generalization bounds with at least a square-root dependency on the cardinality $d$ of the label set, which can be vacuous in practice. In this paper, we significantly improve the state of the art by developing novel high-probability bounds with a logarithmic dependency on $d$ . Moreover, we leverage the lens of algorithmic stability to develop generalization bounds in expectation without any dependency on $d$ . Our results therefore build a solid…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.