Quantitative Overfitting Management for Human-in-the-loop ML Application Development with ease.ml/meter
Frances Ann Hubis, Wentao Wu, Ce Zhang

TL;DR
This paper introduces ease.ml/meter, a system that automatically detects and measures overfitting during ML development, providing probabilistic guarantees and guiding data size decisions to improve model quality management.
Contribution
The paper presents a novel system for automated overfitting detection with probabilistic guarantees, addressing challenges in continuous ML development and adaptive analysis.
Findings
effectively detects overfitting with probabilistic guarantees
guides data size decisions for validation and testing
demonstrates practical tractability and effectiveness
Abstract
Simplifying machine learning (ML) application development, including distributed computation, programming interface, resource management, model selection, etc, has attracted intensive interests recently. These research efforts have significantly improved the efficiency and the degree of automation of developing ML models. In this paper, we take a first step in an orthogonal direction towards automated quality management for human-in-the-loop ML application development. We build ease. ml/meter, a system that can automatically detect and measure the degree of overfitting during the whole lifecycle of ML application development. ease. ml/meter returns overfitting signals with strong probabilistic guarantees, based on which developers can take appropriate actions. In particular, ease. ml/meter provides principled guidelines to simple yet nontrivial questions regarding desired validation and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAnomaly Detection Techniques and Applications · Autonomous Vehicle Technology and Safety · Data Stream Mining Techniques
