GenSF: Simultaneous Adaptation of Generative Pre-trained Models and Slot   Filling

Shikib Mehri; Maxine Eskenazi

arXiv:2106.07055·cs.CL·June 15, 2021·1 cites

GenSF: Simultaneous Adaptation of Generative Pre-trained Models and Slot Filling

Shikib Mehri, Maxine Eskenazi

PDF

Open Access 1 Repo

TL;DR

GenSF introduces a novel method for slot filling that simultaneously adapts pre-trained models and reformulates downstream tasks, achieving state-of-the-art results especially in few-shot and zero-shot scenarios.

Contribution

It proposes a scalable approach that aligns pre-trained models with downstream tasks by joint adaptation, avoiding task-specific pre-training objectives.

Findings

01

9 F1 score improvement in zero-shot slot filling

02

State-of-the-art results on two datasets

03

Strong gains in few-shot and zero-shot settings

Abstract

In transfer learning, it is imperative to achieve strong alignment between a pre-trained model and a downstream task. Prior work has done this by proposing task-specific pre-training objectives, which sacrifices the inherent scalability of the transfer learning paradigm. We instead achieve strong alignment by simultaneously modifying both the pre-trained model and the formulation of the downstream task, which is more efficient and preserves the scalability of transfer learning. We present GenSF (Generative Slot Filling), which leverages a generative pre-trained open-domain dialog model for slot filling. GenSF (1) adapts the pre-trained model by incorporating inductive biases about the task and (2) adapts the downstream task by reformulating slot filling to better leverage the pre-trained model's capabilities. GenSF achieves state-of-the-art results on two slot filling datasets with…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

shikib/generative_slot_filling
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Domain Adaptation and Few-Shot Learning · Multimodal Machine Learning Applications