EvoLM: In Search of Lost Language Model Training Dynamics

Zhenting Qi; Fan Nie; Alexandre Alahi; James Zou; Himabindu Lakkaraju; Yilun Du; Eric Xing; Sham Kakade; and Hanlin Zhang

arXiv:2506.16029·cs.CL·November 19, 2025

EvoLM: In Search of Lost Language Model Training Dynamics

Zhenting Qi, Fan Nie, Alexandre Alahi, James Zou, Himabindu Lakkaraju, Yilun Du, Eric Xing, Sham Kakade, and Hanlin Zhang

PDF

Open Access

TL;DR

EvoLM provides a comprehensive suite for analyzing language model training dynamics across multiple stages, revealing key insights and trade-offs, and promoting transparency through open release of models and data.

Contribution

It introduces EvoLM, a systematic framework for analyzing LM training stages, and offers extensive empirical data and tools for understanding training impacts.

Findings

01

Diminishing returns from excessive pre-training and post-training.

02

Effective strategies for mitigating forgetting during domain-specific continued pre-training.

03

Continued pre-training is crucial for bridging pre-training and post-training phases.

Abstract

Modern language model (LM) training has been divided into multiple stages, making it difficult for downstream developers to evaluate the impact of design choices made at each stage. We present EvoLM, a model suite that enables systematic and transparent analysis of LMs' training dynamics across pre-training, continued pre-training, supervised fine-tuning, and reinforcement learning. We train over 100 LMs with 1B and 4B parameters from scratch, and evaluate both upstream (language modeling) and downstream (problem-solving) capabilities, including considerations of both in-domain and out-of-domain generalization. Key insights highlight the diminishing returns from excessive pre-training and post-training, the importance and practices of mitigating forgetting during domain-specific continued pre-training, the crucial role of continued pre-training in bridging pre-training and post-training…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Artificial Intelligence in Healthcare and Education · Natural Language Processing Techniques