Learning to Predict Concept Ordering for Common Sense Generation
Tianhui Zhang, Danushka Bollegala, Bei Peng

TL;DR
This paper systematically studies how the order of input concepts affects the quality of commonsense sentence generation across various language models, highlighting the importance of concept ordering and human reordering.
Contribution
It introduces a comprehensive analysis of concept ordering strategies, demonstrating the effectiveness of human reordering and identifying the superior performance of BART-large in this task.
Findings
BART-large outperforms other models with concept ordering from training data
Larger GPT-3 models do not necessarily outperform smaller models on this task
Human reordering of concepts yields the best sentence generation results
Abstract
Prior work has shown that the ordering in which concepts are shown to a commonsense generator plays an important role, affecting the quality of the generated sentence. However, it remains a challenge to determine the optimal ordering of a given set of concepts such that a natural sentence covering all the concepts could be generated from a pretrained generator. To understand the relationship between the ordering of the input concepts and the quality of the generated sentences, we conduct a systematic study considering multiple language models (LMs) and concept ordering strategies. We find that BART-large model consistently outperforms all other LMs considered in this study when fine-tuned using the ordering of concepts as they appear in CommonGen training data as measured using multiple evaluation metrics. Moreover, the larger GPT3-based large language models (LLMs) variants do not…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Speech and dialogue systems
