Towards Effective Time-Aware Language Representation: Exploring Enhanced   Temporal Understanding in Language Models

Jiexin Wang; Adam Jatowt; Yi Cai

arXiv:2406.01863·cs.CL·March 6, 2025

Towards Effective Time-Aware Language Representation: Exploring Enhanced Temporal Understanding in Language Models

Jiexin Wang, Adam Jatowt, Yi Cai

PDF

Open Access

TL;DR

BiTimeBERT 2.0 is a novel time-aware language model trained on news data with innovative objectives, significantly improving temporal understanding and reasoning in NLP tasks.

Contribution

Introduces BiTimeBERT 2.0 with three new pre-training objectives and an efficient corpus preprocessing strategy for enhanced temporal language modeling.

Findings

01

Significant performance improvements on time-related NLP tasks

02

Effective modeling of temporal contexts and relations

03

Reduced training time by nearly 53%

Abstract

In the evolving field of Natural Language Processing (NLP), understanding the temporal context of text is increasingly critical for applications requiring advanced temporal reasoning. Traditional pre-trained language models like BERT, which rely on synchronic document collections such as BookCorpus and Wikipedia, often fall short in effectively capturing and leveraging temporal information. To address this limitation, we introduce BiTimeBERT 2.0, a novel time-aware language model pre-trained on a temporal news article collection. BiTimeBERT 2.0 incorporates temporal information through three innovative pre-training objectives: Extended Time-Aware Masked Language Modeling (ETAMLM), Document Dating (DD), and Time-Sensitive Entity Replacement (TSER). Each objective is specifically designed to target a distinct dimension of temporal information: ETAMLM enhances the model's understanding of…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Speech and dialogue systems

MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · WordPiece · Linear Warmup With Linear Decay · Weight Decay · Attention Dropout · Linear Layer · Adam · Attention Is All You Need · Residual Connection · Multi-Head Attention