Studying the Usage of Text-To-Text Transfer Transformer to Support   Code-Related Tasks

Antonio Mastropaolo; Simone Scalabrino; Nathan Cooper; David Nader; Palacio; Denys Poshyvanyk; Rocco Oliveto; Gabriele Bavota

arXiv:2102.02017·cs.SE·February 4, 2021

Studying the Usage of Text-To-Text Transfer Transformer to Support Code-Related Tasks

Antonio Mastropaolo, Simone Scalabrino, Nathan Cooper, David Nader, Palacio, Denys Poshyvanyk, Rocco Oliveto, Gabriele Bavota

PDF

2 Repos

TL;DR

This paper explores the application of the T5 model, pre-trained on combined natural language and source code data, to enhance performance in various code-related tasks such as bug fixing and code comment generation.

Contribution

It demonstrates that pre-training T5 on mixed data improves its effectiveness across multiple software engineering tasks compared to previous models.

Findings

01

T5 outperforms baseline models in bug fixing.

02

Pre-training on combined data enhances code task performance.

03

Single T5 model achieves improvements across four tasks.

Abstract

Deep learning (DL) techniques are gaining more and more attention in the software engineering community. They have been used to support several code-related tasks, such as automatic bug fixing and code comments generation. Recent studies in the Natural Language Processing (NLP) field have shown that the Text-To-Text Transfer Transformer (T5) architecture can achieve state-of-the-art performance for a variety of NLP tasks. The basic idea behind T5 is to first pre-train a model on a large and generic dataset using a self-supervised task ( e.g: filling masked words in sentences). Once the model is pre-trained, it is fine-tuned on smaller and specialized datasets, each one related to a specific task ( e.g: language translation, sentence classification). In this paper, we empirically investigate how the T5 model performs when pre-trained and fine-tuned to support code-related tasks. We…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.