Parallel Iterative Edit Models for Local Sequence Transduction

Abhijeet Awasthi; Sunita Sarawagi; Rasna Goyal; Sabyasachi Ghosh,; Vihari Piratla

arXiv:1910.02893·cs.CL·May 18, 2020·6 cites

Parallel Iterative Edit Models for Local Sequence Transduction

Abhijeet Awasthi, Sunita Sarawagi, Rasna Goyal, Sabyasachi Ghosh,, Vihari Piratla

PDF

Open Access 1 Repo

TL;DR

The paper introduces the Parallel Iterative Edit (PIE) model for local sequence transduction tasks, offering a faster alternative to traditional encoder-decoder models while maintaining competitive accuracy through iterative refinement and label prediction.

Contribution

The PIE model enables parallel decoding in local sequence transduction, combining iterative refinement and label prediction to achieve speed and accuracy improvements over encoder-decoder models.

Findings

01

PIE achieves accuracy comparable to encoder-decoder models.

02

PIE significantly reduces decoding time.

03

Effective across GEC, OCR, and spell correction tasks.

Abstract

We present a Parallel Iterative Edit (PIE) model for the problem of local sequence transduction arising in tasks like Grammatical error correction (GEC). Recent approaches are based on the popular encoder-decoder (ED) model for sequence to sequence learning. The ED model auto-regressively captures full dependency among output tokens but is slow due to sequential decoding. The PIE model does parallel decoding, giving up the advantage of modelling full dependency in the output, yet it achieves accuracy competitive with the ED model for four reasons: 1.~predicting edits instead of tokens, 2.~labeling sequences instead of generating sequences, 3.~iteratively refining predictions to capture dependencies, and 4.~factorizing logits over edits and their token argument to harness pre-trained language models like BERT. Experiments on tasks spanning GEC, OCR correction and spell correction…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

awasthiabhijeet/PIE
tf

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling · Algorithms and Data Compression

MethodsLinear Layer · Weight Decay · Residual Connection · Adam · Layer Normalization · Softmax · Attention Is All You Need · Dropout · Refunds@Expedia|||How do I get a full refund from Expedia? · Multi-Head Attention