TL;DR
This paper explores the application of the T5 model, pre-trained on combined natural language and source code data, to enhance performance in various code-related tasks such as bug fixing and code comment generation.
Contribution
It demonstrates that pre-training T5 on mixed data improves its effectiveness across multiple software engineering tasks compared to previous models.
Findings
T5 outperforms baseline models in bug fixing.
Pre-training on combined data enhances code task performance.
Single T5 model achieves improvements across four tasks.
Abstract
Deep learning (DL) techniques are gaining more and more attention in the software engineering community. They have been used to support several code-related tasks, such as automatic bug fixing and code comments generation. Recent studies in the Natural Language Processing (NLP) field have shown that the Text-To-Text Transfer Transformer (T5) architecture can achieve state-of-the-art performance for a variety of NLP tasks. The basic idea behind T5 is to first pre-train a model on a large and generic dataset using a self-supervised task ( e.g: filling masked words in sentences). Once the model is pre-trained, it is fine-tuned on smaller and specialized datasets, each one related to a specific task ( e.g: language translation, sentence classification). In this paper, we empirically investigate how the T5 model performs when pre-trained and fine-tuned to support code-related tasks. We…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
