Learning Differentiable Surrogate Losses for Structured Prediction

Junjie Yang; Matthieu Labeau; Florence d'Alch\'e-Buc

arXiv:2411.11682·stat.ML·November 19, 2024

Learning Differentiable Surrogate Losses for Structured Prediction

Junjie Yang, Matthieu Labeau, Florence d'Alch\'e-Buc

PDF

Open Access

TL;DR

This paper introduces a neural network-based framework for learning differentiable surrogate loss functions for structured prediction, enabling improved prediction of complex output structures through contrastive learning and gradient-based decoding.

Contribution

It proposes a novel method to learn structured loss functions directly from data using contrastive learning, bypassing the need for domain-specific loss design.

Findings

01

Achieves comparable or better performance than kernel-based methods on graph prediction tasks.

02

Enables prediction of complex structures via gradient descent decoding.

03

Provides a flexible framework for structured prediction with neural networks.

Abstract

Structured prediction involves learning to predict complex structures rather than simple scalar values. The main challenge arises from the non-Euclidean nature of the output space, which generally requires relaxing the problem formulation. Surrogate methods build on kernel-induced losses or more generally, loss functions admitting an Implicit Loss Embedding, and convert the original problem into a regression task followed by a decoding step. However, designing effective losses for objects with complex structures presents significant challenges and often requires domain-specific expertise. In this work, we introduce a novel framework in which a structured loss function, parameterized by neural networks, is learned directly from output training data through Contrastive Learning, prior to addressing the supervised surrogate regression problem. As a result, the differentiable loss not only…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning and Data Classification

MethodsContrastive Learning