From system models to class models: An in-context learning paradigm

Marco Forgione; Filippo Pura; Dario Piga

arXiv:2308.13380·eess.SY·December 21, 2023

From system models to class models: An in-context learning paradigm

Marco Forgione, Filippo Pura, Dario Piga

PDF

Open Access 5 Repos

TL;DR

This paper introduces a novel in-context learning paradigm using Transformer models to identify and predict the behavior of dynamical systems by learning a meta model from synthetic data, rather than directly modeling individual systems.

Contribution

It presents a new approach to system identification that leverages Transformers for in-context learning, enabling predictions based on system class behavior without explicit system modeling.

Findings

01

Meta model effectively predicts system behavior from limited context.

02

Transformer-based approach outperforms traditional system identification methods.

03

Experimental results validate the feasibility of in-context learning for dynamical systems.

Abstract

Is it possible to understand the intricacies of a dynamical system not solely from its input/output pattern, but also by observing the behavior of other systems within the same class? This central question drives the study presented in this paper. In response to this query, we introduce a novel paradigm for system identification, addressing two primary tasks: one-step-ahead prediction and multi-step simulation. Unlike conventional methods, we do not directly estimate a model for the specific system. Instead, we learn a meta model that represents a class of dynamical systems. This meta model is trained on a potentially infinite stream of synthetic data, generated by simulators whose settings are randomly extracted from a probability distribution. When provided with a context from a new system-specifically, an input/output sequence-the meta model implicitly discerns its dynamics,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTime Series Analysis and Forecasting · Anomaly Detection Techniques and Applications · Neural Networks and Applications

MethodsAttention Is All You Need · Linear Layer · Dropout · Byte Pair Encoding · Adam · Position-Wise Feed-Forward Layer · Multi-Head Attention · Absolute Position Encodings · Residual Connection · Label Smoothing