Coherence and Diversity through Noise: Self-Supervised Paraphrase   Generation via Structure-Aware Denoising

Rishabh Gupta; Venktesh V.; Mukesh Mohania; Vikram Goyal

arXiv:2302.02780·cs.CL·February 7, 2023

Coherence and Diversity through Noise: Self-Supervised Paraphrase Generation via Structure-Aware Denoising

Rishabh Gupta, Venktesh V., Mukesh Mohania, Vikram Goyal

PDF

Open Access

TL;DR

This paper introduces SCANING, an unsupervised framework for paraphrasing algebraic word problems by controlled noise injection, enhancing diversity and semantic preservation to improve educational applications and reduce plagiarism.

Contribution

The paper presents a novel structure-aware denoising approach for unsupervised paraphrasing, specifically tailored for complex algebraic word problems, addressing limitations of existing methods.

Findings

01

Significant improvement in semantic preservation and diversity over baselines.

02

Effective handling of complex, lengthy algebraic problems.

03

Validated across four datasets with extensive evaluations.

Abstract

In this paper, we propose SCANING, an unsupervised framework for paraphrasing via controlled noise injection. We focus on the novel task of paraphrasing algebraic word problems having practical applications in online pedagogy as a means to reduce plagiarism as well as ensure understanding on the part of the student instead of rote memorization. This task is more complex than paraphrasing general-domain corpora due to the difficulty in preserving critical information for solution consistency of the paraphrased word problem, managing the increased length of the text and ensuring diversity in the generated paraphrase. Existing approaches fail to demonstrate adequate performance on at least one, if not all, of these facets, necessitating the need for a more comprehensive solution. To this end, we model the noising search space as a composition of contextual and syntactic aspects and sample…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Text Readability and Simplification

Methodsfail