Embarrassingly Simple Performance Prediction for Abductive Natural   Language Inference

Em\=ils Kadi\c{k}is; Vaibhav Srivastav; Roman Klinger

arXiv:2202.10408·cs.CL·July 12, 2022

Embarrassingly Simple Performance Prediction for Abductive Natural Language Inference

Em\=ils Kadi\c{k}is, Vaibhav Srivastav, Roman Klinger

PDF

Open Access 1 Repo

TL;DR

This paper introduces a simple, fast method to predict the performance of models on abductive natural language inference tasks by using cosine similarity of sentence embeddings, saving significant time in model selection.

Contribution

It proposes a novel performance prediction approach based on embedding similarity that correlates well with actual model accuracy, eliminating the need for extensive fine-tuning.

Findings

01

Cosine similarity correlates with classifier accuracy (Pearson r=0.65).

02

Performance prediction is significantly faster (less than a minute).

03

Method enables efficient model selection for abductive NLI.

Abstract

The task of abductive natural language inference (\alpha{}nli), to decide which hypothesis is the more likely explanation for a set of observations, is a particularly difficult type of NLI. Instead of just determining a causal relationship, it requires common sense to also evaluate how reasonable an explanation is. All recent competitive systems build on top of contextualized representations and make use of transformer architectures for learning an NLI model. When somebody is faced with a particular NLI task, they need to select the best model that is available. This is a time-consuming and resource-intense endeavour. To solve this practical problem, we propose a simple method for predicting the performance without actually fine-tuning the model. We do this by testing how well the pre-trained models perform on the \alpha{}nli task when just comparing sentence embeddings with cosine…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Vaibhavs10/anli-performance-prediction
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Explainable Artificial Intelligence (XAI)

MethodsLinear Layer · Absolute Position Encodings · Position-Wise Feed-Forward Layer · Residual Connection · Label Smoothing · Dropout · Byte Pair Encoding · Adam · Dense Connections · Softmax