Alternators For Sequence Modeling
Mohammad Reza Rezaei, Adji Bousso Dieng

TL;DR
This paper presents alternators, a new non-Markovian sequence model with two neural networks that alternate outputs, capable of modeling complex dynamics, forecasting, and data imputation across diverse scientific domains.
Contribution
Introduces alternators, a novel sequence modeling framework with alternating neural networks, improving stability, sampling speed, and performance over existing models.
Findings
Successfully modeled chaotic Lorenz dynamics.
Achieved accurate brain activity to physical activity mapping.
Improved sea-surface temperature forecasting.
Abstract
This paper introduces alternators, a novel family of non-Markovian dynamical models for sequences. An alternator features two neural networks: the observation trajectory network (OTN) and the feature trajectory network (FTN). The OTN and the FTN work in conjunction, alternating between outputting samples in the observation space and some feature space, respectively, over a cycle. The parameters of the OTN and the FTN are not time-dependent and are learned via a minimum cross-entropy criterion over the trajectories. Alternators are versatile. They can be used as dynamical latent-variable generative models or as sequence-to-sequence predictors. Alternators can uncover the latent dynamics underlying complex sequential data, accurately forecast and impute missing data, and sample new trajectories. We showcase the capabilities of alternators in three applications. We first used alternators…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAlgorithms and Data Compression
MethodsDiffusion
