Time-Stamped Language Model: Teaching Language Models to Understand the   Flow of Events

Hossein Rajaby Faghihi; Parisa Kordjamshidi

arXiv:2104.07635·cs.CL·April 16, 2021

Time-Stamped Language Model: Teaching Language Models to Understand the Flow of Events

Hossein Rajaby Faghihi, Parisa Kordjamshidi

PDF

1 Repo

TL;DR

This paper introduces the Time-Stamped Language Model (TSLM) that enhances language models' ability to understand event sequences in procedural texts by incorporating timestamp encoding, leading to improved performance on related tasks.

Contribution

The paper proposes a novel timestamp encoding method within language models to better capture the flow of events in procedural texts, improving state-of-the-art results.

Findings

01

Achieved a 3.1% increase in F1 score on Propara dataset.

02

Outperformed previous models on location prediction in NPN-Cooking dataset.

03

Demonstrated general effectiveness for procedural text understanding.

Abstract

Tracking entities throughout a procedure described in a text is challenging due to the dynamic nature of the world described in the process. Firstly, we propose to formulate this task as a question answering problem. This enables us to use pre-trained transformer-based language models on other QA benchmarks by adapting those to the procedural text understanding. Secondly, since the transformer-based language models cannot encode the flow of events by themselves, we propose a Time-Stamped Language Model~(TSLM model) to encode event information in LMs architecture by introducing the timestamp encoding. Our model evaluated on the Propara dataset shows improvements on the published state-of-the-art results with a $3.1%$ increase in F1 score. Moreover, our model yields better results on the location prediction task on the NPN-Cooking dataset. This result indicates that our approach is…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

HLR/TSLM
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.