Learning Differentiable Surrogate Losses for Structured Prediction
Junjie Yang, Matthieu Labeau, Florence d'Alch\'e-Buc

TL;DR
This paper introduces a neural network-based framework for learning differentiable surrogate loss functions for structured prediction, enabling improved prediction of complex output structures through contrastive learning and gradient-based decoding.
Contribution
It proposes a novel method to learn structured loss functions directly from data using contrastive learning, bypassing the need for domain-specific loss design.
Findings
Achieves comparable or better performance than kernel-based methods on graph prediction tasks.
Enables prediction of complex structures via gradient descent decoding.
Provides a flexible framework for structured prediction with neural networks.
Abstract
Structured prediction involves learning to predict complex structures rather than simple scalar values. The main challenge arises from the non-Euclidean nature of the output space, which generally requires relaxing the problem formulation. Surrogate methods build on kernel-induced losses or more generally, loss functions admitting an Implicit Loss Embedding, and convert the original problem into a regression task followed by a decoding step. However, designing effective losses for objects with complex structures presents significant challenges and often requires domain-specific expertise. In this work, we introduce a novel framework in which a structured loss function, parameterized by neural networks, is learned directly from output training data through Contrastive Learning, prior to addressing the supervised surrogate regression problem. As a result, the differentiable loss not only…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and Data Classification
MethodsContrastive Learning
