An Application of Pseudo-Log-Likelihoods to Natural Language Scoring
Darren Abramson, Ali Emami

TL;DR
This paper demonstrates that a hyperparameter-free zero-shot method using pseudo-log-likelihoods can outperform larger models on common sense reasoning tasks, showing robustness and state-of-the-art results without fine-tuning.
Contribution
It introduces a zero-shot evaluation technique based on pseudo-log-likelihoods for masked language models, achieving superior results on reasoning tasks compared to fine-tuned models.
Findings
Smaller models outperform larger models on TimeDial dataset.
Zero-shot method achieves state-of-the-art results in common sense reasoning.
Model performance remains robust under adversarial conditions.
Abstract
Language models built using semi-supervised machine learning on large corpora of natural language have very quickly enveloped the fields of natural language generation and understanding. In this paper we apply a zero-shot approach independently developed by a number of researchers now gaining recognition as a significant alternative to fine-tuning for evaluation on common sense tasks. A language model with relatively few parameters and training steps compared to a more recent language model (T5) can outperform it on a recent large data set (TimeDial), while displaying robustness in its performance across a similar class of language tasks. Surprisingly, this result is achieved by using a hyperparameter-free zero-shot method with the smaller model, compared to fine-tuning to the larger model. We argue that robustness of the smaller model ought to be understood in terms of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Text Readability and Simplification
MethodsMulti-Head Attention · Attention Is All You Need · Linear Layer · Layer Normalization · LAMB · Dropout · Refunds@Expedia|||How do I get a full refund from Expedia? · Dense Connections · Weight Decay · Linear Warmup With Linear Decay
