Metadata Might Make Language Models Better

Kaspar Beelen; Daniel van Strien

arXiv:2211.10086·cs.CL·November 21, 2022

Metadata Might Make Language Models Better

Kaspar Beelen, Daniel van Strien

PDF

Open Access 4 Models 1 Datasets

TL;DR

Incorporating metadata such as time, politics, and geography into language models trained on historical texts improves their robustness, fairness, and overall performance, as demonstrated through experiments on 19th-century newspapers.

Contribution

This study extends the time-masking approach by systematically evaluating different strategies for integrating metadata into language models trained on historical data.

Findings

01

Metadata inclusion improves model robustness.

02

Metadata enhances fairness in language models.

03

Models with metadata outperform baseline models.

Abstract

This paper discusses the benefits of including metadata when training language models on historical collections. Using 19th-century newspapers as a case study, we extend the time-masking approach proposed by Rosin et al., 2022 and compare different strategies for inserting temporal, political and geographical information into a Masked Language Model. After fine-tuning several DistilBERT on enhanced input data, we provide a systematic evaluation of these models on a set of evaluation tasks: pseudo-perplexity, metadata mask-filling and supervised classification. We find that showing relevant metadata to a language model has a beneficial impact and may even produce more robust and fairer models.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Models

Datasets

davanstrien/testgitupload
dataset· 14 dl
14 dl

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsComputational and Text Analysis Methods · Natural Language Processing Techniques

MethodsMulti-Head Attention · Attention Is All You Need · Linear Layer · Dense Connections · Residual Connection · Attention Dropout · Refunds@Expedia|||How do I get a full refund from Expedia? · Linear Warmup With Linear Decay · WordPiece · Softmax