Neural Search Space in Gboard Decoder
Yanxiang Zhang, Yuanbo Zhang, Haicheng Sun, Yun Wang, Billy Dou, Gary, Sivek, Shumin Zhai

TL;DR
This paper introduces Neural Search Space, replacing N-gram language models with neural network models in Gboard decoding, leading to improved suggestion quality by incorporating long-range context.
Contribution
It proposes a novel method to integrate neural network language models into the Gboard decoder's search space dynamically at runtime, overcoming N-gram limitations.
Findings
Reduced Words Modified Ratio by up to 1.19% across locales
Improved suggestion quality with acceptable latency increase
Redesigned FST structure for neural network integration
Abstract
Gboard Decoder produces suggestions by looking for paths that best match input touch points on the context aware search space, which is backed by the language Finite State Transducers (FST). The language FST is currently an N-gram language model (LM). However, N-gram LMs, limited in context length, are known to have sparsity problem under device model size constraint. In this paper, we propose \textbf{Neural Search Space} which substitutes the N-gram LM with a Neural Network LM (NN-LM) and dynamically constructs the search space during decoding. Specifically, we integrate the long range context awareness of NN-LM into the search space by converting its outputs given context, into the language FST at runtime. This involves language FST structure redesign, pruning strategy tuning, and data structure optimizations. Online experiments demonstrate improved quality results, reducing Words…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsImage Processing and 3D Reconstruction · Advanced Numerical Analysis Techniques · Advanced Computational Techniques and Applications
MethodsAttentive Walk-Aggregating Graph Neural Network · Pruning
