Alternators For Sequence Modeling

Mohammad Reza Rezaei; Adji Bousso Dieng

arXiv:2405.11848·stat.ML·December 3, 2024

Alternators For Sequence Modeling

Mohammad Reza Rezaei, Adji Bousso Dieng

PDF

Open Access 1 Repo

TL;DR

This paper presents alternators, a new non-Markovian sequence model with two neural networks that alternate outputs, capable of modeling complex dynamics, forecasting, and data imputation across diverse scientific domains.

Contribution

Introduces alternators, a novel sequence modeling framework with alternating neural networks, improving stability, sampling speed, and performance over existing models.

Findings

01

Successfully modeled chaotic Lorenz dynamics.

02

Achieved accurate brain activity to physical activity mapping.

03

Improved sea-surface temperature forecasting.

Abstract

This paper introduces alternators, a novel family of non-Markovian dynamical models for sequences. An alternator features two neural networks: the observation trajectory network (OTN) and the feature trajectory network (FTN). The OTN and the FTN work in conjunction, alternating between outputting samples in the observation space and some feature space, respectively, over a cycle. The parameters of the OTN and the FTN are not time-dependent and are learned via a minimum cross-entropy criterion over the trajectories. Alternators are versatile. They can be used as dynamical latent-variable generative models or as sequence-to-sequence predictors. Alternators can uncover the latent dynamics underlying complex sequential data, accurately forecast and impute missing data, and sample new trajectories. We showcase the capabilities of alternators in three applications. We first used alternators…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

vertaix/Alternators
pytorch

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAlgorithms and Data Compression

MethodsDiffusion