Learning Sequential Latent Variable Models from Multimodal Time Series Data
Oliver Limoyo, Trevor Ablett, and Jonathan Kelly

TL;DR
This paper introduces a self-supervised probabilistic framework for learning latent dynamics models from multimodal sequential data, improving prediction and representation quality in robotic tasks.
Contribution
It extends latent dynamics models to effectively incorporate multimodal data using a novel self-supervised generative approach, outperforming simple concatenation baselines.
Findings
Significant improvements in prediction accuracy.
Better representation quality compared to concatenation baseline.
Nearly as effective as supervised methods without labels.
Abstract
Sequential modelling of high-dimensional data is an important problem that appears in many domains including model-based reinforcement learning and dynamics identification for control. Latent variable models applied to sequential data (i.e., latent dynamics models) have been shown to be a particularly effective probabilistic approach to solve this problem, especially when dealing with images. However, in many application areas (e.g., robotics), information from multiple sensing modalities is available -- existing latent dynamics methods have not yet been extended to effectively make use of such multimodal sequential data. Multimodal sensor streams can be correlated in a useful manner and often contain complementary information across modalities. In this work, we present a self-supervised generative modelling framework to jointly learn a probabilistic latent state representation of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTime Series Analysis and Forecasting · Gaussian Processes and Bayesian Inference · Data Visualization and Analytics
