Probing Script Knowledge from Pre-Trained Models

Zijian Jin; Xingyu Zhang; Mo Yu; Lifu Huang

arXiv:2204.10176·cs.CL·April 22, 2022

Probing Script Knowledge from Pre-Trained Models

Zijian Jin, Xingyu Zhang, Mo Yu, Lifu Huang

PDF

Open Access

TL;DR

This paper investigates how well pre-trained language models like BERT understand script knowledge related to daily tasks, revealing strengths in temporal ordering but weaknesses in sub-event selection.

Contribution

It introduces three probing tasks to evaluate script knowledge in PLMs and analyzes BERT's capabilities in script induction and sub-event understanding.

Findings

01

BERT captures stereotypical temporal knowledge of sub-events

02

BERT poorly encodes inclusive and starting sub-event knowledge

03

Probing tasks can induce scripts from sub-events

Abstract

Script knowledge is critical for humans to understand the broad daily tasks and routine activities in the world. Recently researchers have explored the large-scale pre-trained language models (PLMs) to perform various script related tasks, such as story generation, temporal ordering of event, future event prediction and so on. However, it's still not well studied in terms of how well the PLMs capture the script knowledge. To answer this question, we design three probing tasks: inclusive sub-event selection, starting sub-event selection and temporal ordering to investigate the capabilities of PLMs with and without fine-tuning. The three probing tasks can be further used to automatically induce a script for each main event given all the possible sub-events. Taking BERT as a case study, by analyzing its performance on script induction as well as each individual probing task, we conclude…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Multimodal Machine Learning Applications

MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Attention Is All You Need · Linear Layer · Adam · Multi-Head Attention · Residual Connection · Dense Connections · Attention Dropout · Softmax · Linear Warmup With Linear Decay