Analysis and Prediction of NLP Models Via Task Embeddings

Damien Sileo; Marie-Francine Moens

arXiv:2112.05647·cs.CL·December 13, 2021

Analysis and Prediction of NLP Models Via Task Embeddings

Damien Sileo, Marie-Francine Moens

PDF

Open Access 1 Repo

TL;DR

This paper introduces MetaEval, a collection of NLP tasks, and demonstrates how task embeddings can analyze, predict, and improve zero-shot transfer learning performance across diverse NLP tasks.

Contribution

It proposes a unified transformer model conditioned on task embeddings for analyzing and predicting task properties, enabling zero-shot inference without annotated examples.

Findings

01

Task embeddings reveal meaningful relationships among NLP tasks.

02

Predicted embeddings improve zero-shot performance on GLUE tasks.

03

MetaEval serves as a new benchmark for transfer learning research.

Abstract

Task embeddings are low-dimensional representations that are trained to capture task properties. In this paper, we propose MetaEval, a collection of $101$ NLP tasks. We fit a single transformer to all MetaEval tasks jointly while conditioning it on learned embeddings. The resulting task embeddings enable a novel analysis of the space of tasks. We then show that task aspects can be mapped to task embeddings for new tasks without using any annotated examples. Predicted embeddings can modulate the encoder for zero-shot inference and outperform a zero-shot baseline on GLUE tasks. The provided multitask setup can function as a benchmark for future transfer learning research.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

sileod/metaeval
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDomain Adaptation and Few-Shot Learning · Machine Learning in Healthcare · Human Pose and Action Recognition