Retrieval Enhanced Model for Commonsense Generation

Han Wang; Yang Liu; Chenguang Zhu; Linjun Shou; Ming Gong; Yichong Xu,; Michael Zeng

arXiv:2105.11174·cs.CL·May 25, 2021

Retrieval Enhanced Model for Commonsense Generation

Han Wang, Yang Liu, Chenguang Zhu, Linjun Shou, Ming Gong, Yichong Xu,, Michael Zeng

PDF

Open Access 1 Repo

TL;DR

This paper introduces a retrieval-augmented framework for commonsense generation that improves performance by retrieving prototype sentences and using a trainable retriever, setting new state-of-the-art results on the CommonGen benchmark.

Contribution

It proposes a novel retrieval-based approach to enhance both pre-training and fine-tuning in commonsense generation tasks.

Findings

01

Achieves new state-of-the-art results on the CommonGen benchmark.

02

Retrieval-augmented method improves reasoning and generalization.

03

Utilizing prototype sentences boosts generation quality.

Abstract

Commonsense generation is a challenging task of generating a plausible sentence describing an everyday scenario using provided concepts. Its requirement of reasoning over commonsense knowledge and compositional generalization ability even puzzles strong pre-trained language generation models. We propose a novel framework using retrieval methods to enhance both the pre-training and fine-tuning for commonsense generation. We retrieve prototype sentence candidates by concept matching and use them as auxiliary input. For fine-tuning, we further boost its performance with a trainable sentence retriever. We demonstrate experimentally on the large-scale CommonGen benchmark that our approach achieves new state-of-the-art results.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

HanNight/RE-T5
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Multimodal Machine Learning Applications