Probing Simile Knowledge from Pre-trained Language Models
Weijie Chen, Yongzhu Chang, Rongsheng Zhang, Jiashu Pu, Guandan Chen,, Le Zhang, Yadong Xi, Yijiang Chen, Chang Su

TL;DR
This paper explores the potential of pre-trained language models to understand and generate similes by probing their embedded knowledge through a novel unified framework involving pattern-based masked sentence completion.
Contribution
It introduces a new unified framework for simile interpretation and generation by probing PLMs, utilizing pattern ensemble, search, and a secondary training process to improve prediction quality.
Findings
Effective in simile interpretation and generation tasks
Outperforms baseline models in automatic and human evaluations
Enhances prediction diversity with secondary training
Abstract
Simile interpretation (SI) and simile generation (SG) are challenging tasks for NLP because models require adequate world knowledge to produce predictions. Previous works have employed many hand-crafted resources to bring knowledge-related into models, which is time-consuming and labor-intensive. In recent years, pre-trained language models (PLMs) based approaches have become the de-facto standard in NLP since they learn generic knowledge from a large corpus. The knowledge embedded in PLMs may be useful for SI and SG tasks. Nevertheless, there are few works to explore it. In this paper, we probe simile knowledge from PLMs to solve the SI and SG tasks in the unified framework of simile triple completion for the first time. The backbone of our framework is to construct masked sentences with manual patterns and then predict the candidate words in the masked position. In this framework, we…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · Topic Modeling · Text Readability and Simplification
