Neural Networks for Text Correction and Completion in Keyboard Decoding

Shaona Ghosh; Per Ola Kristensson

arXiv:1709.06429·cs.CL·September 20, 2017·56 cites

Neural Networks for Text Correction and Completion in Keyboard Decoding

Shaona Ghosh, Per Ola Kristensson

PDF

Open Access 1 Models

TL;DR

This paper introduces a neural attention-based sequence-to-sequence model for text correction and completion tailored for resource-constrained devices, achieving high accuracy with a small dataset and low memory footprint.

Contribution

It presents a novel neural network architecture combining CNN and GRU for efficient text correction and completion on limited-resource devices, with a new noisy-to-corrected dataset derived from Twitter data.

Findings

01

Achieved 90% word accuracy and 2.4% CER on Twitter typo dataset.

02

Developed a smaller, efficient model with an order of magnitude less memory footprint.

03

Reported 98% word accuracy on a new noisy-to-corrected dataset.

Abstract

Despite the ubiquity of mobile and wearable text messaging applications, the problem of keyboard text decoding is not tackled sufficiently in the light of the enormous success of the deep learning Recurrent Neural Network (RNN) and Convolutional Neural Networks (CNN) for natural language understanding. In particular, considering that the keyboard decoders should operate on devices with memory and processor resource constraints, makes it challenging to deploy industrial scale deep neural network (DNN) models. This paper proposes a sequence-to-sequence neural attention network system for automatic text correction and completion. Given an erroneous sequence, our model encodes character level hidden representations and then decodes the revised sequence thus enabling auto-correction and completion. We achieve this by a combination of character level CNN and gated recurrent unit (GRU) encoder…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Models

🤗
AGofficial/NeuralCode
model· ♡ 1
♡ 1

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Speech Recognition and Synthesis · Topic Modeling