Unified BERT for Few-shot Natural Language Understanding

Junyu Lu; Ping Yang; Ruyi Gan; Jing Yang; Jiaxing Zhang

arXiv:2206.12094·cs.CL·August 16, 2022·1 cites

Unified BERT for Few-shot Natural Language Understanding

Junyu Lu, Ping Yang, Ruyi Gan, Jing Yang, Jiaxing Zhang

PDF

Open Access 2 Models

TL;DR

UBERT is a unified BERT-based model designed for diverse NLU tasks, leveraging a biaffine network to unify training and improve semantic understanding across multiple tasks, especially in few-shot settings.

Contribution

The paper introduces UBERT, a novel unified model that encodes various NLU tasks into a common framework using biaffine scoring, enhancing multi-task learning and few-shot performance.

Findings

01

Won first place in the 2022 AIWIN competition.

02

Unified approach effectively handles multiple NLU tasks.

03

Improved performance in few-shot multi-task scenarios.

Abstract

Even as pre-trained language models share a semantic encoder, natural language understanding suffers from a diversity of output schemas. In this paper, we propose UBERT, a unified bidirectional language understanding model based on BERT framework, which can universally model the training objects of different NLU tasks through a biaffine network. Specifically, UBERT encodes prior knowledge from various aspects, uniformly constructing learning representations across multiple NLU tasks, which is conducive to enhancing the ability to capture common semantic understanding. By using the biaffine to model scores pair of the start and end position of the original text, various classification and extraction structures can be converted into a universal, span-decoding approach. Experiments show that UBERT wins the first price in the 2022 AIWIN - World Artificial Intelligence Innovation…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Models

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Multimodal Machine Learning Applications

MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Multi-Head Attention · Attention Is All You Need · Linear Layer · Softmax · Residual Connection · Adam · Attention Dropout · Layer Normalization · Linear Warmup With Linear Decay