Diverse Title Generation for Stack Overflow Posts with Multiple Sampling Enhanced Transformer
Fengji Zhang, Jin Liu, Yao Wan, Xiao Yu, Xiao Liu, Jacky Keung

TL;DR
This paper introduces M3NSCT5, a Transformer-based method that generates diverse, high-quality titles for Stack Overflow posts from code snippets, aiding developers in better problem description and retrieval.
Contribution
It proposes a novel maximal marginal multiple nucleus sampling strategy and a large-scale dataset, improving title diversity and quality over existing models.
Findings
M3NSCT5 outperforms six baseline models on BLEU and ROUGE metrics.
Human evaluation confirms the quality and diversity of generated titles.
The approach demonstrates potential for real-world application in programming communities.
Abstract
Stack Overflow is one of the most popular programming communities where developers can seek help for their encountered problems. Nevertheless, if inexperienced developers fail to describe their problems clearly, it is hard for them to attract sufficient attention and get the anticipated answers. We propose MNSCT5, a novel approach to automatically generate multiple post titles from the given code snippets. Developers may use the generated titles to find closely related posts and complete their problem descriptions. MNSCT5 employs the CodeT5 backbone, which is a pre-trained Transformer model having an excellent language understanding and generation ability. To alleviate the ambiguity issue that the same code snippets could be aligned with different titles under varying contexts, we propose the maximal marginal multiple nucleus sampling strategy to generate multiple high-quality…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSoftware Engineering Research · Software Engineering Techniques and Practices · Topic Modeling
MethodsMulti-Head Attention · Attention Is All You Need · Linear Layer · SentencePiece · Inverse Square Root Schedule · Attention Dropout · Adafactor · Layer Normalization · Byte Pair Encoding · Gated Linear Unit
