Sentence Semantic Regression for Text Generation
Wei Wang, Piji Li, Hai-Tao Zheng

TL;DR
This paper introduces Sentence Semantic Regression (SSR), a novel two-phase text generation framework that improves idea reasoning and surface realization, leading to better consistency and performance in various generation tasks.
Contribution
The paper proposes SSR, a new sentence-level language modeling framework with autoregressive and bidirectional architectures, addressing issues like idea drift in text generation.
Findings
SSR outperforms baselines in automatic metrics
SSR achieves higher human evaluation scores
Effective across multiple text generation tasks
Abstract
Recall the classical text generation works, the generation framework can be briefly divided into two phases: \textbf{idea reasoning} and \textbf{surface realization}. The target of idea reasoning is to figure out the main idea which will be presented in the following talking/writing periods. Surface realization aims to arrange the most appropriate sentence to depict and convey the information distilled from the main idea. However, the current popular token-by-token text generation methods ignore this crucial process and suffer from many serious issues, such as idea/topic drift. To tackle the problems and realize this two-phase paradigm, we propose a new framework named Sentence Semantic Regression (\textbf{SSR}) based on sentence-level language modeling. For idea reasoning, two architectures \textbf{SSR-AR} and \textbf{SSR-NonAR} are designed to conduct sentence semantic regression…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Multimodal Machine Learning Applications
