Probing Script Knowledge from Pre-Trained Models
Zijian Jin, Xingyu Zhang, Mo Yu, Lifu Huang

TL;DR
This paper investigates how well pre-trained language models like BERT understand script knowledge related to daily tasks, revealing strengths in temporal ordering but weaknesses in sub-event selection.
Contribution
It introduces three probing tasks to evaluate script knowledge in PLMs and analyzes BERT's capabilities in script induction and sub-event understanding.
Findings
BERT captures stereotypical temporal knowledge of sub-events
BERT poorly encodes inclusive and starting sub-event knowledge
Probing tasks can induce scripts from sub-events
Abstract
Script knowledge is critical for humans to understand the broad daily tasks and routine activities in the world. Recently researchers have explored the large-scale pre-trained language models (PLMs) to perform various script related tasks, such as story generation, temporal ordering of event, future event prediction and so on. However, it's still not well studied in terms of how well the PLMs capture the script knowledge. To answer this question, we design three probing tasks: inclusive sub-event selection, starting sub-event selection and temporal ordering to investigate the capabilities of PLMs with and without fine-tuning. The three probing tasks can be further used to automatically induce a script for each main event given all the possible sub-events. Taking BERT as a case study, by analyzing its performance on script induction as well as each individual probing task, we conclude…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Multimodal Machine Learning Applications
MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Attention Is All You Need · Linear Layer · Adam · Multi-Head Attention · Residual Connection · Dense Connections · Attention Dropout · Softmax · Linear Warmup With Linear Decay
