Tensor Networks for Probabilistic Sequence Modeling
Jacob Miller, Guillaume Rabusseau, John Terilla

TL;DR
This paper explores the use of tensor networks, specifically uniform matrix product states, for probabilistic sequence modeling, enabling efficient sequence evaluation and novel structured data generation.
Contribution
It introduces a u-MPS-based probabilistic model with sequence-level parallelism and a new generative algorithm for complex conditional distributions, expanding the capabilities of tensor network models.
Findings
u-MPS enable sequence evaluation in O(log n) depth
The generative algorithm can produce richly structured data
u-MPS outperform baselines on synthetic and real text data
Abstract
Tensor networks are a powerful modeling framework developed for computational many-body physics, which have only recently been applied within machine learning. In this work we utilize a uniform matrix product state (u-MPS) model for probabilistic modeling of sequence data. We first show that u-MPS enable sequence-level parallelism, with length-n sequences able to be evaluated in depth O(log n). We then introduce a novel generative algorithm giving trained u-MPS the ability to efficiently sample from a wide variety of conditional distributions, each one defined by a regular expression. Special cases of this algorithm correspond to autoregressive and fill-in-the-blank sampling, but more complex regular expressions permit the generation of richly structured data in a manner that has no direct analogue in neural generative models. Experiments on sequence modeling with synthetic and real…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTensor decomposition and applications · Quantum many-body systems · Parallel Computing and Optimization Techniques
MethodsSigmoid Activation · Tanh Activation · Long Short-Term Memory
