Leap+Verify: Regime-Adaptive Speculative Weight Prediction for Accelerating Neural Network Training

Jeremy McEntire

arXiv:2602.19580·cs.LG·February 24, 2026

Leap+Verify: Regime-Adaptive Speculative Weight Prediction for Accelerating Neural Network Training

Jeremy McEntire

PDF

Open Access

TL;DR

Leap+Verify introduces a regime-adaptive speculative execution framework that predicts neural network weights during training, validated by loss criteria, to accelerate training especially in predictable regimes, with varying success across model scales.

Contribution

This work proposes a novel regime-aware speculative weight prediction method for neural network training, inspired by speculative decoding and ASC architecture, with empirical validation on large language models.

Findings

01

Finite-difference predictors improve acceptance rates over momentum-based methods.

02

Larger models exhibit more predictable regimes but are less often predictable overall.

03

The three-regime framework consistently identifies phase boundaries across different training seeds.

Abstract

We introduce Leap+Verify, a framework that applies speculative execution -- predicting future model weights and validating predictions before acceptance -- to accelerate neural network training. Inspired by speculative decoding in language model inference and by the Automatically Scalable Computation (ASC) architecture for program execution, Leap+Verify decomposes training into three dynamically detected regimes (chaotic, transition, stable) using activation-space cosine similarity as a real-time Lyapunov proxy signal. Within each regime, analytic weight predictors (momentum, linear, quadratic extrapolation) attempt to forecast model parameters K training steps ahead; predictions are accepted only when validated against a held-out loss criterion. We evaluate Leap+Verify on GPT-2 124M and Qwen 2.5-1.5B trained on WikiText-103 across five random seeds, sweeping prediction depth K in {5,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Machine Learning in Materials Science · Topic Modeling