TL;DR
This paper introduces two novel SE-specific out-of-distribution detection models for deep code models, enhancing their trustworthiness by effectively identifying data that differs from training distributions, with significant experimental improvements.
Contribution
The paper proposes unsupervised and weakly-supervised OOD detection methods tailored for software engineering tasks, addressing the challenge of distribution shifts in deep code models.
Findings
Proposed methods outperform baselines in four OOD scenarios.
Enhanced OOD detection improves main code understanding tasks.
Models effectively handle distribution shifts in real-world settings.
Abstract
Numerous machine learning (ML) models have been developed, including those for software engineering (SE) tasks, under the assumption that training and testing data come from the same distribution. However, training and testing distributions often differ, as training datasets rarely encompass the entire distribution, while testing distribution tends to shift over time. Hence, when confronted with out-of-distribution (OOD) instances that differ from the training data, a reliable and trustworthy SE ML model must be capable of detecting them to either abstain from making predictions, or potentially forward these OODs to appropriate models handling other categories or tasks. In this paper, we develop two types of SE-specific OOD detection models, unsupervised and weakly-supervised OOD detection for code. The unsupervised OOD detection approach is trained solely on in-distribution samples…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
