Unified BERT for Few-shot Natural Language Understanding
Junyu Lu, Ping Yang, Ruyi Gan, Jing Yang, Jiaxing Zhang

TL;DR
UBERT is a unified BERT-based model designed for diverse NLU tasks, leveraging a biaffine network to unify training and improve semantic understanding across multiple tasks, especially in few-shot settings.
Contribution
The paper introduces UBERT, a novel unified model that encodes various NLU tasks into a common framework using biaffine scoring, enhancing multi-task learning and few-shot performance.
Findings
Won first place in the 2022 AIWIN competition.
Unified approach effectively handles multiple NLU tasks.
Improved performance in few-shot multi-task scenarios.
Abstract
Even as pre-trained language models share a semantic encoder, natural language understanding suffers from a diversity of output schemas. In this paper, we propose UBERT, a unified bidirectional language understanding model based on BERT framework, which can universally model the training objects of different NLU tasks through a biaffine network. Specifically, UBERT encodes prior knowledge from various aspects, uniformly constructing learning representations across multiple NLU tasks, which is conducive to enhancing the ability to capture common semantic understanding. By using the biaffine to model scores pair of the start and end position of the original text, various classification and extraction structures can be converted into a universal, span-decoding approach. Experiments show that UBERT wins the first price in the 2022 AIWIN - World Artificial Intelligence Innovation…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Multimodal Machine Learning Applications
MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Multi-Head Attention · Attention Is All You Need · Linear Layer · Softmax · Residual Connection · Adam · Attention Dropout · Layer Normalization · Linear Warmup With Linear Decay
