Keeping Deep Learning Models in Check: A History-Based Approach to Mitigate Overfitting
Hao Li, Gopi Krishnan Rajbahadur, Dayi Lin, Cor-Paul Bezemer, and Zhen, Ming (Jack) Jiang

TL;DR
This paper introduces a history-based method using a time series classifier to detect and prevent overfitting in deep learning models, outperforming existing approaches in accuracy and early stopping capabilities.
Contribution
The authors propose a novel, simple approach that leverages training history to both detect and prevent overfitting without modifying model structure or requiring extensive resources.
Findings
Achieves an F1 score of 0.91, outperforming existing detection methods.
Can stop training at least 32% earlier than early stopping.
Maintains or improves the rate of returning the best model.
Abstract
In software engineering, deep learning models are increasingly deployed for critical tasks such as bug detection and code review. However, overfitting remains a challenge that affects the quality, reliability, and trustworthiness of software systems that utilize deep learning models. Overfitting can be (1) prevented (e.g., using dropout or early stopping) or (2) detected in a trained model (e.g., using correlation-based approaches). Both overfitting detection and prevention approaches that are currently used have constraints (e.g., requiring modification of the model structure, and high computing resources). In this paper, we propose a simple, yet powerful approach that can both detect and prevent overfitting based on the training history (i.e., validation losses). Our approach first trains a time series classifier on training histories of overfit models. This classifier is then used to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSoftware Engineering Research · Adversarial Robustness in Machine Learning · Anomaly Detection Techniques and Applications
MethodsEarly Stopping · Dropout
