Out-of-Support Generalisation via Weight-Space Sequence Modelling

Roussel Desmond Nzoyem

arXiv:2602.13550·cs.LG·March 6, 2026

Out-of-Support Generalisation via Weight-Space Sequence Modelling

Roussel Desmond Nzoyem

PDF

Open Access

TL;DR

This paper introduces WeightCaster, a novel approach that models neural network weights as sequences to improve out-of-support generalisation, providing more reliable and interpretable predictions in safety-critical applications.

Contribution

The paper proposes a new sequence modelling framework in weight space for OoS generalisation, achieving competitive results without explicit inductive biases.

Findings

01

Outperforms state-of-the-art on synthetic and real-world datasets

02

Produces plausible and uncertainty-aware predictions

03

Maintains high computational efficiency

Abstract

As breakthroughs in deep learning transform key industries, models are increasingly required to extrapolate on datapoints found outside the range of the training set, a challenge we coin as out-of-support (OoS) generalisation. However, neural networks frequently exhibit catastrophic failure on OoS samples, yielding unrealistic but overconfident predictions. We address this challenge by reformulating the OoS generalisation problem as a sequence modelling task in the weight space, wherein the training set is partitioned into concentric shells corresponding to discrete sequential steps. Our WeightCaster framework yields plausible, interpretable, and uncertainty-aware predictions without necessitating explicit inductive biases, all the while maintaining high computational efficiency. Emprical validation on a synthetic cosine dataset and real-world air quality sensor readings demonstrates…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAir Quality Monitoring and Forecasting · Adversarial Robustness in Machine Learning · Advanced Neural Network Applications