Predictive Sparse Manifold Transform
Yujia Xie, Xinhui Li, Vince D. Calhoun

TL;DR
The paper introduces PSMT, a biologically plausible framework that learns and predicts natural dynamics using sparse coding and manifold learning, demonstrating improved future frame prediction in videos.
Contribution
It presents a novel two-layer framework combining sparse coding and manifold learning for dynamic prediction, with a focus on interpretability and biological plausibility.
Findings
PSMT outperforms static embedding baselines in future frame prediction.
Dynamic embedding space improves prediction accuracy.
The framework effectively captures topological and temporal structures in data.
Abstract
We present Predictive Sparse Manifold Transform (PSMT), a minimalistic, interpretable and biologically plausible framework for learning and predicting natural dynamics. PSMT incorporates two layers where the first sparse coding layer represents the input sequence as sparse coefficients over an overcomplete dictionary and the second manifold learning layer learns a geometric embedding space that captures topological similarity and dynamic temporal linearity in sparse coefficients. We apply PSMT on a natural video dataset and evaluate the reconstruction performance with respect to contextual variability, the number of sparse coding basis functions and training samples. We then interpret the dynamic topological organization in the embedding space. We next utilize PSMT to predict future frames compared with two baseline methods with a static embedding space. We demonstrate that PSMT with a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHuman Pose and Action Recognition · Advanced Vision and Imaging · Video Surveillance and Tracking Methods
