Temporal Effects on Pre-trained Models for Language Processing Tasks
Oshin Agarwal, Ani Nenkova

TL;DR
This paper investigates how the performance of pre-trained language models changes over time, distinguishing between model deterioration and domain adaptation, and finds that temporal adaptation improves task performance.
Contribution
It introduces a nuanced terminology for temporal effects, analyzes their impact on language tasks, and evaluates methods for temporal domain adaptation without human annotations.
Findings
Temporal model deterioration is not always a concern.
Temporal domain adaptation improves performance across tasks.
Self-labeling outperforms human annotations in named entity recognition.
Abstract
Keeping the performance of language technologies optimal as time passes is of great practical interest. We study temporal effects on model performance on downstream language tasks, establishing a nuanced terminology for such discussion and identifying factors essential to conduct a robust study. We present experiments for several tasks in English where the label correctness is not dependent on time and demonstrate the importance of distinguishing between temporal model deterioration and temporal domain adaptation for systems using pre-trained representations. We find that depending on the task, temporal model deterioration is not necessarily a concern. Temporal domain adaptation however is beneficial in all cases, with better performance for a given time period possible when the system is trained on temporally more recent data. Therefore, we also examine the efficacy of two approaches…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Biomedical Text Mining and Ontologies
