Does the Objective Matter? Comparing Training Objectives for Pronoun   Resolution

Yordan Yordanov; Oana-Maria Camburu; Vid Kocijan; Thomas Lukasiewicz

arXiv:2010.02570·cs.CL·October 7, 2020

Does the Objective Matter? Comparing Training Objectives for Pronoun Resolution

Yordan Yordanov, Oana-Maria Camburu, Vid Kocijan, Thomas Lukasiewicz

PDF

1 Repo

TL;DR

This paper compares four training objectives for pronoun resolution using pre-trained language models, revealing their strengths and instabilities across in-domain and out-of-domain settings.

Contribution

It provides a fair comparison of different training objectives for pronoun resolution, highlighting their performance differences and stability issues.

Findings

01

Sequence ranking performs best in-domain.

02

Semantic similarity performs best out-of-domain.

03

Sequence ranking shows seed-wise instability.

Abstract

Hard cases of pronoun resolution have been used as a long-standing benchmark for commonsense reasoning. In the recent literature, pre-trained language models have been used to obtain state-of-the-art results on pronoun resolution. Overall, four categories of training and evaluation objectives have been introduced. The variety of training datasets and pre-trained language models used in these works makes it unclear whether the choice of training objective is critical. In this work, we make a fair comparison of the performance and seed-wise stability of four models that represent the four categories of objectives. Our experiments show that the objective of sequence ranking performs the best in-domain, while the objective of semantic similarity between candidates and pronoun performs the best out-of-domain. We also observe a seed-wise instability of the model using sequence ranking, which…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

YDYordanov/WS-training-objectives
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.