Good things come in three: Generating SO Post Titles with Pre-Trained Models, Self Improvement and Post Ranking
Duc Anh Le, Anh M. T. Bui, Phuong T. Nguyen, Davide Di Ruscio

TL;DR
This paper introduces FILLER, a novel approach for generating high-quality Stack Overflow post titles by fine-tuning pre-trained language models with self-improvement and post ranking, significantly outperforming existing baselines.
Contribution
The study presents a new training paradigm incorporating self-improvement and post ranking to enhance title generation quality for Stack Overflow posts.
Findings
FILLER outperforms baseline models like Code2Que and GPT3.5-turbo.
The model generates more relevant and high-quality titles according to empirical evaluation.
User studies confirm the superiority of FILLER in relevance and quality.
Abstract
Stack Overflow is a prominent Q and A forum, supporting developers in seeking suitable resources on programming-related matters. Having high-quality question titles is an effective means to attract developers' attention. Unfortunately, this is often underestimated, leaving room for improvement. Research has been conducted, predominantly leveraging pre-trained models to generate titles from code snippets and problem descriptions. Yet, getting high-quality titles is still a challenging task, attributed to both the quality of the input data (e.g., containing noise and ambiguity) and inherent constraints in sequence generation models. In this paper, we present FILLER as a solution to generating Stack Overflow post titles using a fine-tuned language model with self-improvement and post ranking. Our study focuses on enhancing pre-trained language models for generating titles for Stack…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSemantic Web and Ontologies · Biomedical Text Mining and Ontologies
