Improving Logical-Level Natural Language Generation with Topic-Conditioned Data Augmentation and Logical Form Generation
Ao Liu, Congjian Luo, Naoaki Okazaki

TL;DR
This paper enhances logical-level natural language generation by using GPT-2 for data augmentation and introducing a dual logical form generation task, significantly improving performance with semi-supervised training.
Contribution
It proposes topic-conditioned data augmentation and a logical form generation task, enabling semi-supervised learning to improve Logic2text models without extensive manual annotations.
Findings
Outperforms supervised baselines on Logic2text dataset
Effective utilization of augmented data improves generation quality
Semi-supervised approach benefits from back-translation signals
Abstract
Logical Natural Language Generation, i.e., generating textual descriptions that can be logically entailed by a structured table, has been a challenge due to the low fidelity of the generation. \citet{chen2020logic2text} have addressed this problem by annotating interim logical programs to control the generation contents and semantics, and presented the task of table-aware logical form to text (Logic2text) generation. However, although table instances are abundant in the real world, logical forms paired with textual descriptions require costly human annotation work, which limits the performance of neural models. To mitigate this, we propose topic-conditioned data augmentation (TopicDA), which utilizes GPT-2 to generate unpaired logical forms and textual descriptions directly from tables. We further introduce logical form generation (LG), a dual task of Logic2text that requires generating…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Multimodal Machine Learning Applications
MethodsAttention Is All You Need · Linear Layer · Adam · Multi-Head Attention · Cosine Annealing · Residual Connection · Dense Connections · Refunds@Expedia|||How do I get a full refund from Expedia? · Layer Normalization · Linear Warmup With Cosine Annealing
