A Quantum Search Decoder for Natural Language Processing

Johannes Bausch; Sathyawageeswar Subramanian; Stephen Piddock

arXiv:1909.05023·quant-ph·May 3, 2021

A Quantum Search Decoder for Natural Language Processing

Johannes Bausch, Sathyawageeswar Subramanian, Stephen Piddock

PDF

TL;DR

This paper introduces a quantum algorithm for decoding language models that finds globally optimal predictions more efficiently than classical methods, especially for power-law distributed inputs, and demonstrates practical speedups in speech recognition.

Contribution

It develops a quantum decoding algorithm that surpasses classical beam search in speed and accuracy, particularly for power-law distributed language data.

Findings

01

Quantum decoder achieves more than quadratic speedup over classical methods.

02

Modified quantum algorithm performs well with finite beam width, maintaining high accuracy.

03

Application to speech recognition shows the model's power-law distribution enables practical quantum speedups.

Abstract

Probabilistic language models, e.g. those based on an LSTM, often face the problem of finding a high probability prediction from a sequence of random variables over a set of tokens. This is commonly addressed using a form of greedy decoding such as beam search, where a limited number of highest-likelihood paths (the beam width) of the decoder are kept, and at the end the maximum-likelihood path is chosen. In this work, we construct a quantum algorithm to find the globally optimal parse (i.e. for infinite beam width) with high constant success probability. When the input to the decoder is distributed as a power-law with exponent $k > 0$ , our algorithm has runtime $R^{n f (R, k)}$ , where $R$ is the alphabet size, $n$ the input length; here $f < 1/2$ , and $f \to 0$ exponentially fast with increasing $k$ , hence making our algorithm always more than quadratically faster than its classical…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsSigmoid Activation · Tanh Activation · Long Short-Term Memory