Text Alignment Is An Efficient Unified Model for Massive NLP Tasks

Yuheng Zha; Yichi Yang; Ruichen Li; Zhiting Hu

arXiv:2307.02729·cs.CL·November 3, 2023

Text Alignment Is An Efficient Unified Model for Massive NLP Tasks

Yuheng Zha, Yichi Yang, Ruichen Li, Zhiting Hu

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper introduces Text Alignment, a compact and efficient model that unifies multiple NLP tasks by measuring text information alignment, outperforming larger models on diverse datasets and enhancing language model evaluations.

Contribution

Proposes a lightweight text alignment model that effectively handles various NLP tasks, outperforming larger models and serving as an add-on for LLMs.

Findings

01

Matches or surpasses larger models on 20+ datasets

02

Improves factual consistency evaluation over GPT-3.5 and GPT-4

03

Enhances question answering accuracy when integrated with LLMs

Abstract

Large language models (LLMs), typically designed as a function of next-word prediction, have excelled across extensive NLP tasks. Despite the generality, next-word prediction is often not an efficient formulation for many of the tasks, demanding an extreme scale of model parameters (10s or 100s of billions) and sometimes yielding suboptimal performance. In practice, it is often desirable to build more efficient models -- despite being less versatile, they still apply to a substantial subset of problems, delivering on par or even superior performance with much smaller model sizes. In this paper, we propose text alignment as an efficient unified model for a wide range of crucial tasks involving text entailment, similarity, question answering (and answerability), factual consistency, and so forth. Given a pair of texts, the model measures the degree of alignment between their information.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

yuh-zha/align
pytorch

Videos

Text Alignment Is An Efficient Unified Model for Massive NLP Tasks· slideslive

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Text Readability and Simplification

MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · {Dispute@FaQ-s}How to file a dispute with Expedia? · Multi-Head Attention · Attention Is All You Need · Linear Warmup With Linear Decay · Linear Layer · WordPiece · BERT · Adam · Cosine Annealing