CodeTrans: Towards Cracking the Language of Silicon's Code Through Self-Supervised Deep Learning and High Performance Computing
Ahmed Elnaggar, Wei Ding, Llion Jones, Tom Gibbs, Tamas Feher,, Christoph Angerer, Silvia Severini, Florian Matthes, Burkhard Rost

TL;DR
CodeTrans is a transformer-based model designed for multiple software engineering tasks, leveraging self-supervised learning and high-performance computing to outperform existing models and facilitate future research.
Contribution
The paper introduces CodeTrans, a novel encoder-decoder transformer model tailored for software engineering tasks, with comprehensive training strategies and publicly available pre-trained models.
Findings
CodeTrans outperforms state-of-the-art models on all tested tasks.
Different training strategies significantly impact model performance.
Pre-trained models are publicly available for future research.
Abstract
Currently, a growing number of mature natural language processing applications make people's life more convenient. Such applications are built by source code - the language in software engineering. However, the applications for understanding source code language to ease the software engineering process are under-researched. Simultaneously, the transformer model, especially its combination with transfer learning, has been proven to be a powerful technique for natural language processing tasks. These breakthroughs point out a promising direction for process source code and crack software engineering tasks. This paper describes CodeTrans - an encoder-decoder transformer model for tasks in the software engineering domain, that explores the effectiveness of encoder-decoder transformer models for six software engineering tasks, including thirteen sub-tasks. Moreover, we have investigated the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSoftware Engineering Research · Software Reliability and Analysis Research · Software Engineering Techniques and Practices
MethodsLinear Layer · Byte Pair Encoding · Gated Linear Unit · Refunds@Expedia|||How do I get a full refund from Expedia? · Inverse Square Root Schedule · Adafactor · Dense Connections · Softmax · Attention Dropout · Dropout
