Neural Networks for Text Correction and Completion in Keyboard Decoding
Shaona Ghosh, Per Ola Kristensson

TL;DR
This paper introduces a neural attention-based sequence-to-sequence model for text correction and completion tailored for resource-constrained devices, achieving high accuracy with a small dataset and low memory footprint.
Contribution
It presents a novel neural network architecture combining CNN and GRU for efficient text correction and completion on limited-resource devices, with a new noisy-to-corrected dataset derived from Twitter data.
Findings
Achieved 90% word accuracy and 2.4% CER on Twitter typo dataset.
Developed a smaller, efficient model with an order of magnitude less memory footprint.
Reported 98% word accuracy on a new noisy-to-corrected dataset.
Abstract
Despite the ubiquity of mobile and wearable text messaging applications, the problem of keyboard text decoding is not tackled sufficiently in the light of the enormous success of the deep learning Recurrent Neural Network (RNN) and Convolutional Neural Networks (CNN) for natural language understanding. In particular, considering that the keyboard decoders should operate on devices with memory and processor resource constraints, makes it challenging to deploy industrial scale deep neural network (DNN) models. This paper proposes a sequence-to-sequence neural attention network system for automatic text correction and completion. Given an erroneous sequence, our model encodes character level hidden representations and then decodes the revised sequence thus enabling auto-correction and completion. We achieve this by a combination of character level CNN and gated recurrent unit (GRU) encoder…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · Speech Recognition and Synthesis · Topic Modeling
