Keeping Deep Learning Models in Check: A History-Based Approach to   Mitigate Overfitting

Hao Li; Gopi Krishnan Rajbahadur; Dayi Lin; Cor-Paul Bezemer; and Zhen; Ming (Jack) Jiang

arXiv:2401.10359·cs.SE·May 21, 2024·2 cites

Keeping Deep Learning Models in Check: A History-Based Approach to Mitigate Overfitting

Hao Li, Gopi Krishnan Rajbahadur, Dayi Lin, Cor-Paul Bezemer, and Zhen, Ming (Jack) Jiang

PDF

Open Access 1 Repo

TL;DR

This paper introduces a history-based method using a time series classifier to detect and prevent overfitting in deep learning models, outperforming existing approaches in accuracy and early stopping capabilities.

Contribution

The authors propose a novel, simple approach that leverages training history to both detect and prevent overfitting without modifying model structure or requiring extensive resources.

Findings

01

Achieves an F1 score of 0.91, outperforming existing detection methods.

02

Can stop training at least 32% earlier than early stopping.

03

Maintains or improves the rate of returning the best model.

Abstract

In software engineering, deep learning models are increasingly deployed for critical tasks such as bug detection and code review. However, overfitting remains a challenge that affects the quality, reliability, and trustworthiness of software systems that utilize deep learning models. Overfitting can be (1) prevented (e.g., using dropout or early stopping) or (2) detected in a trained model (e.g., using correlation-based approaches). Both overfitting detection and prevention approaches that are currently used have constraints (e.g., requiring modification of the model structure, and high computing resources). In this paper, we propose a simple, yet powerful approach that can both detect and prevent overfitting based on the training history (i.e., validation losses). Our approach first trains a time series classifier on training histories of overfit models. This classifier is then used to…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

asgaardlab/overfitguard
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSoftware Engineering Research · Adversarial Robustness in Machine Learning · Anomaly Detection Techniques and Applications

MethodsEarly Stopping · Dropout