Is It Worth the (Environmental) Cost? Limited Evidence for Temporal   Adaptation via Continuous Training

Giuseppe Attanasio; Debora Nozza; Federico Bianchi; Dirk Hovy

arXiv:2210.07365·cs.CL·May 5, 2023·1 cites

Is It Worth the (Environmental) Cost? Limited Evidence for Temporal Adaptation via Continuous Training

Giuseppe Attanasio, Debora Nozza, Federico Bianchi, Dirk Hovy

PDF

Open Access

TL;DR

This study evaluates whether continuous training of language models for social media data improves performance over time, finding that it often does not justify the environmental costs involved.

Contribution

The paper provides empirical evidence questioning the benefits of temporal adaptation in language models and highlights the need for suitable benchmarks and sustainable practices.

Findings

01

Pretrained models outperform temporally adapted models in social media tasks.

02

Continuous training does not significantly improve downstream task performance.

03

Current benchmarks are insufficient for evaluating temporal adaptation.

Abstract

Language is constantly changing and evolving, leaving language models to become quickly outdated. Consequently, we should continuously update our models with new data to expose them to new events and facts. However, that requires additional computing, which means new carbon emissions. Do any measurable benefits justify this cost? This paper looks for empirical evidence to support continuous training. We reproduce existing benchmarks and extend them to include additional time periods, models, and tasks. Our results show that the downstream task performance of temporally adapted English models for social media data do not improve over time. Pretrained models without temporal adaptation are actually significantly more effective and efficient. However, we also note a lack of suitable temporal benchmarks. Our findings invite a critical reflection on when and how to temporally adapt language…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Expert finding and Q&A systems