Unsupervised Distractor Generation via Large Language Model Distilling   and Counterfactual Contrastive Decoding

Fanyi Qu; Hao Sun; Yunfang Wu

arXiv:2406.01306·cs.CL·June 4, 2024

Unsupervised Distractor Generation via Large Language Model Distilling and Counterfactual Contrastive Decoding

Fanyi Qu, Hao Sun, Yunfang Wu

PDF

Open Access 1 Video

TL;DR

This paper introduces an unsupervised distractor generation framework for reading comprehension that uses large language models for knowledge distillation and counterfactual contrastive decoding, outperforming larger models with fewer resources.

Contribution

It presents a novel unsupervised approach combining LLM-based knowledge distillation and counterfactual decoding for distractor generation, reducing reliance on annotated data and large models.

Findings

01

Outperforms GPT-3.5-turbo with fewer parameters

02

Uses dual task training with pseudo distractors and answer info

03

Effective in practical reading comprehension applications

Abstract

Within the context of reading comprehension, the task of Distractor Generation (DG) aims to generate several incorrect options to confuse readers. Traditional supervised methods for DG rely heavily on expensive human-annotated distractor labels. In this paper, we propose an unsupervised DG framework, leveraging Large Language Models (LLMs) as cost-effective annotators to enhance the DG capability of smaller student models. Specially, to perform knowledge distilling, we propose a dual task training strategy that integrates pseudo distractors from LLMs and the original answer in-formation as the objective targets with a two-stage training process. Moreover, we devise a counterfactual contrastive decoding mechanism for increasing the distracting capability of the DG model. Experiments show that our unsupervised generation method with Bart-base greatly surpasses GPT-3.5-turbo performance…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Unsupervised Distractor Generation via Large Language Model Distilling and Counterfactual Contrastive Decoding· underline

Taxonomy

TopicsAdvanced Text Analysis Techniques · Topic Modeling · Natural Language Processing Techniques

MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · 15 Ways to Contact How can i speak to someone at Delta Airlines · Linear Layer · Adam · Cosine Annealing · Attention Is All You Need · Residual Connection · Multi-Head Attention · Dropout · Dense Connections