Circuit-Based Intrinsic Methods to Detect Overfitting
Satrajit Chatterjee, Alan Mishchenko

TL;DR
This paper introduces Counterfactual Simulation (CFS), an intrinsic, model-agnostic method to detect overfitting by analyzing training data flow through models like neural networks and random forests, without relying on test data.
Contribution
The paper proposes CFS, a hyper-parameter-free intrinsic method applicable across various models, providing new insights into neural network generalization and overfitting detection.
Findings
CFS effectively distinguishes models with different overfitting levels.
Neural networks tend to find common patterns rather than brute-force memorization.
Insights into neural network generalization compared to random forests.
Abstract
The focus of this paper is on intrinsic methods to detect overfitting. By intrinsic methods, we mean methods that rely only on the model and the training data, as opposed to traditional methods (we call them extrinsic methods) that rely on performance on a test set or on bounds from model complexity. We propose a family of intrinsic methods, called Counterfactual Simulation (CFS), which analyze the flow of training examples through the model by identifying and perturbing rare patterns. By applying CFS to logic circuits we get a method that has no hyper-parameters and works uniformly across different types of models such as neural networks, random forests and lookup tables. Experimentally, CFS can separate models with different levels of overfit using only their logic circuit representations without any access to the high level structure. By comparing lookup tables, neural networks, and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Machine Learning and Algorithms · Machine Learning and Data Classification
