An Application of Pseudo-Log-Likelihoods to Natural Language Scoring

Darren Abramson; Ali Emami

arXiv:2201.09377·cs.CL·January 25, 2022·1 cites

An Application of Pseudo-Log-Likelihoods to Natural Language Scoring

Darren Abramson, Ali Emami

PDF

Open Access

TL;DR

This paper demonstrates that a hyperparameter-free zero-shot method using pseudo-log-likelihoods can outperform larger models on common sense reasoning tasks, showing robustness and state-of-the-art results without fine-tuning.

Contribution

It introduces a zero-shot evaluation technique based on pseudo-log-likelihoods for masked language models, achieving superior results on reasoning tasks compared to fine-tuned models.

Findings

01

Smaller models outperform larger models on TimeDial dataset.

02

Zero-shot method achieves state-of-the-art results in common sense reasoning.

03

Model performance remains robust under adversarial conditions.

Abstract

Language models built using semi-supervised machine learning on large corpora of natural language have very quickly enveloped the fields of natural language generation and understanding. In this paper we apply a zero-shot approach independently developed by a number of researchers now gaining recognition as a significant alternative to fine-tuning for evaluation on common sense tasks. A language model with relatively few parameters and training steps compared to a more recent language model (T5) can outperform it on a recent large data set (TimeDial), while displaying robustness in its performance across a similar class of language tasks. Surprisingly, this result is achieved by using a hyperparameter-free zero-shot method with the smaller model, compared to fine-tuning to the larger model. We argue that robustness of the smaller model ought to be understood in terms of…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Text Readability and Simplification

MethodsMulti-Head Attention · Attention Is All You Need · Linear Layer · Layer Normalization · LAMB · Dropout · Refunds@Expedia|||How do I get a full refund from Expedia? · Dense Connections · Weight Decay · Linear Warmup With Linear Decay