Unsupervised Pronoun Resolution via Masked Noun-Phrase Prediction

Ming Shen; Pratyay Banerjee; Chitta Baral

arXiv:2105.12392·cs.CL·May 31, 2021

Unsupervised Pronoun Resolution via Masked Noun-Phrase Prediction

Ming Shen, Pratyay Banerjee, Chitta Baral

PDF

Open Access

TL;DR

This paper introduces Masked Noun-Phrase Prediction (MNPP), an unsupervised pre-training approach that significantly improves pronoun resolution performance across multiple datasets, outperforming previous methods and baselines.

Contribution

It presents a novel unsupervised pre-training strategy for pronoun resolution that outperforms existing methods and enhances few-shot learning capabilities.

Findings

01

Outperforms all previous unsupervised methods on pronoun resolution datasets

02

Achieves higher AUC scores after fine-tuning on WinoGrande splits

03

Significantly outperforms RoBERTa-large baseline in few-shot settings

Abstract

In this work, we propose Masked Noun-Phrase Prediction (MNPP), a pre-training strategy to tackle pronoun resolution in a fully unsupervised setting. Firstly, We evaluate our pre-trained model on various pronoun resolution datasets without any finetuning. Our method outperforms all previous unsupervised methods on all datasets by large margins. Secondly, we proceed to a few-shot setting where we finetune our pre-trained model on WinoGrande-S and XS separately. Our method outperforms RoBERTa-large baseline with large margins, meanwhile, achieving a higher AUC score after further finetuning on the remaining three official splits of WinoGrande.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Text Readability and Simplification