# Learning Neural Sequence-to-Sequence Models from Weak Feedback with   Bipolar Ramp Loss

**Authors:** Laura Jehl, Carolin Lawrence, Stefan Riezler

arXiv: 1907.03748 · 2019-07-10

## TL;DR

This paper introduces bipolar ramp loss objectives for training neural sequence-to-sequence models under weak supervision, effectively discouraging negative outputs and outperforming existing methods on translation and parsing tasks.

## Contribution

The paper adapts ramp loss to neural models with bipolarity, proposing a novel token-level ramp loss that improves weakly supervised sequence modeling.

## Key findings

- Bipolar ramp loss outperforms non-bipolar ramp loss and MRT.
- Token-level ramp loss surpasses sequence-level ramp loss.
- Method improves weakly supervised machine translation and semantic parsing.

## Abstract

In many machine learning scenarios, supervision by gold labels is not available and consequently neural models cannot be trained directly by maximum likelihood estimation (MLE). In a weak supervision scenario, metric-augmented objectives can be employed to assign feedback to model outputs, which can be used to extract a supervision signal for training. We present several objectives for two separate weakly supervised tasks, machine translation and semantic parsing. We show that objectives should actively discourage negative outputs in addition to promoting a surrogate gold structure. This notion of bipolarity is naturally present in ramp loss objectives, which we adapt to neural models. We show that bipolar ramp loss objectives outperform other non-bipolar ramp loss objectives and minimum risk training (MRT) on both weakly supervised tasks, as well as on a supervised machine translation task. Additionally, we introduce a novel token-level ramp loss objective, which is able to outperform even the best sequence-level ramp loss on both weakly supervised tasks.

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/1907.03748/full.md

## Figures

5 figures with captions in the complete paper: https://tomesphere.com/paper/1907.03748/full.md

## References

48 references — full list in the complete paper: https://tomesphere.com/paper/1907.03748/full.md

---
Source: https://tomesphere.com/paper/1907.03748