From system models to class models: An in-context learning paradigm
Marco Forgione, Filippo Pura, Dario Piga

TL;DR
This paper introduces a novel in-context learning paradigm using Transformer models to identify and predict the behavior of dynamical systems by learning a meta model from synthetic data, rather than directly modeling individual systems.
Contribution
It presents a new approach to system identification that leverages Transformers for in-context learning, enabling predictions based on system class behavior without explicit system modeling.
Findings
Meta model effectively predicts system behavior from limited context.
Transformer-based approach outperforms traditional system identification methods.
Experimental results validate the feasibility of in-context learning for dynamical systems.
Abstract
Is it possible to understand the intricacies of a dynamical system not solely from its input/output pattern, but also by observing the behavior of other systems within the same class? This central question drives the study presented in this paper. In response to this query, we introduce a novel paradigm for system identification, addressing two primary tasks: one-step-ahead prediction and multi-step simulation. Unlike conventional methods, we do not directly estimate a model for the specific system. Instead, we learn a meta model that represents a class of dynamical systems. This meta model is trained on a potentially infinite stream of synthetic data, generated by simulators whose settings are randomly extracted from a probability distribution. When provided with a context from a new system-specifically, an input/output sequence-the meta model implicitly discerns its dynamics,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTime Series Analysis and Forecasting · Anomaly Detection Techniques and Applications · Neural Networks and Applications
MethodsAttention Is All You Need · Linear Layer · Dropout · Byte Pair Encoding · Adam · Position-Wise Feed-Forward Layer · Multi-Head Attention · Absolute Position Encodings · Residual Connection · Label Smoothing
