TimeLMs: Diachronic Language Models from Twitter
Daniel Loureiro, Francesco Barbieri, Leonardo Neves, Luis Espinosa, Anke, Jose Camacho-Collados

TL;DR
TimeLMs are specialized diachronic Twitter language models that leverage continual learning to better handle evolving language, trends, and out-of-distribution data, improving robustness and adaptability in social media NLP tasks.
Contribution
This paper introduces TimeLMs, a novel approach using continual learning for diachronic Twitter language modeling, addressing the gap of temporal dynamics in NLP.
Findings
Continual learning enhances Twitter language models' ability to handle future data.
TimeLMs perform competitively with standard benchmarks.
Qualitative analyses reveal how models adapt to trends and concept drift.
Abstract
Despite its importance, the time variable has been largely neglected in the NLP and language model literature. In this paper, we present TimeLMs, a set of language models specialized on diachronic Twitter data. We show that a continual learning strategy contributes to enhancing Twitter-based language models' capacity to deal with future and out-of-distribution tweets, while making them competitive with standardized and more monolithic benchmarks. We also perform a number of qualitative analyses showing how they cope with trends and peaks in activity involving specific named entities or concept drift.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
- 🤗cardiffnlp/twitter-roberta-base-2019-90mmodel· 5 dl5 dl
- 🤗cardiffnlp/twitter-roberta-base-2021-124mmodel· 36 dl· ♡ 736 dl♡ 7
- 🤗cardiffnlp/twitter-roberta-base-dec2020model· 5 dl5 dl
- 🤗cardiffnlp/twitter-roberta-base-dec2021model· 69 dl69 dl
- 🤗cardiffnlp/twitter-roberta-base-jun2020model· 3 dl3 dl
- 🤗cardiffnlp/twitter-roberta-base-jun2021model· 3 dl3 dl
- 🤗cardiffnlp/twitter-roberta-base-mar2020model· 3 dl3 dl
- 🤗cardiffnlp/twitter-roberta-base-mar2021model· 3 dl3 dl
- 🤗cardiffnlp/twitter-roberta-base-sep2020model· 6 dl· ♡ 16 dl♡ 1
- 🤗cardiffnlp/twitter-roberta-base-sep2021model· 8 dl8 dl
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsData Stream Mining Techniques · Recommender Systems and Techniques · Caching and Content Delivery
