Text Alignment Is An Efficient Unified Model for Massive NLP Tasks
Yuheng Zha, Yichi Yang, Ruichen Li, Zhiting Hu

TL;DR
This paper introduces Text Alignment, a compact and efficient model that unifies multiple NLP tasks by measuring text information alignment, outperforming larger models on diverse datasets and enhancing language model evaluations.
Contribution
Proposes a lightweight text alignment model that effectively handles various NLP tasks, outperforming larger models and serving as an add-on for LLMs.
Findings
Matches or surpasses larger models on 20+ datasets
Improves factual consistency evaluation over GPT-3.5 and GPT-4
Enhances question answering accuracy when integrated with LLMs
Abstract
Large language models (LLMs), typically designed as a function of next-word prediction, have excelled across extensive NLP tasks. Despite the generality, next-word prediction is often not an efficient formulation for many of the tasks, demanding an extreme scale of model parameters (10s or 100s of billions) and sometimes yielding suboptimal performance. In practice, it is often desirable to build more efficient models -- despite being less versatile, they still apply to a substantial subset of problems, delivering on par or even superior performance with much smaller model sizes. In this paper, we propose text alignment as an efficient unified model for a wide range of crucial tasks involving text entailment, similarity, question answering (and answerability), factual consistency, and so forth. Given a pair of texts, the model measures the degree of alignment between their information.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Text Readability and Simplification
MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · {Dispute@FaQ-s}How to file a dispute with Expedia? · Multi-Head Attention · Attention Is All You Need · Linear Warmup With Linear Decay · Linear Layer · WordPiece · BERT · Adam · Cosine Annealing
