Metadata Conditioned Large Language Models for Localization

Anjishnu Mukherjee; Ziwei Zhu; Antonios Anastasopoulos

arXiv:2601.15236·cs.CL·January 22, 2026

Metadata Conditioned Large Language Models for Localization

Anjishnu Mukherjee, Ziwei Zhu, Antonios Anastasopoulos

PDF

Open Access

TL;DR

This paper demonstrates that metadata conditioning in large language models enhances localization performance, enabling models to adapt to regional differences effectively without extensive region-specific training.

Contribution

The study introduces a lightweight metadata conditioning method for large language models, improving localization and efficiency across regions using annotated news data.

Findings

01

Metadata conditioning improves in-region performance.

02

Models recover localization comparable to region-specific models.

03

Metadata conditioning enhances learning efficiency.

Abstract

Large language models are typically trained by treating text as a single global distribution, often resulting in geographically homogenized behavior. We study metadata conditioning as a lightweight approach for localization, pre-training 31 models (at 0.5B and 1B parameter scales) from scratch on large-scale English news data annotated with verified URLs, country tags, and continent tags, covering 4 continents and 17 countries. Across four controlled experiments, we show that metadata conditioning consistently improves in-region performance without sacrificing cross-region generalization, enables global models to recover localization comparable to region-specific models, and improves learning efficiency. Our ablation studies demonstrate that URL-level metadata alone captures much of the geographic signal, while balanced regional data coverage remains essential, as metadata cannot fully…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Multimodal Machine Learning Applications · Domain Adaptation and Few-Shot Learning