Probing Simile Knowledge from Pre-trained Language Models

Weijie Chen; Yongzhu Chang; Rongsheng Zhang; Jiashu Pu; Guandan Chen,; Le Zhang; Yadong Xi; Yijiang Chen; Chang Su

arXiv:2204.12807·cs.CL·April 28, 2022

Probing Simile Knowledge from Pre-trained Language Models

Weijie Chen, Yongzhu Chang, Rongsheng Zhang, Jiashu Pu, Guandan Chen,, Le Zhang, Yadong Xi, Yijiang Chen, Chang Su

PDF

Open Access 1 Repo

TL;DR

This paper explores the potential of pre-trained language models to understand and generate similes by probing their embedded knowledge through a novel unified framework involving pattern-based masked sentence completion.

Contribution

It introduces a new unified framework for simile interpretation and generation by probing PLMs, utilizing pattern ensemble, search, and a secondary training process to improve prediction quality.

Findings

01

Effective in simile interpretation and generation tasks

02

Outperforms baseline models in automatic and human evaluations

03

Enhances prediction diversity with secondary training

Abstract

Simile interpretation (SI) and simile generation (SG) are challenging tasks for NLP because models require adequate world knowledge to produce predictions. Previous works have employed many hand-crafted resources to bring knowledge-related into models, which is time-consuming and labor-intensive. In recent years, pre-trained language models (PLMs) based approaches have become the de-facto standard in NLP since they learn generic knowledge from a large corpus. The knowledge embedded in PLMs may be useful for SI and SG tasks. Nevertheless, there are few works to explore it. In this paper, we probe simile knowledge from PLMs to solve the SI and SG tasks in the unified framework of simile triple completion for the first time. The backbone of our framework is to construct masked sentences with manual patterns and then predict the candidate words in the masked position. In this framework, we…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

nairoj/Probing-Simile-from-PLM
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling · Text Readability and Simplification