Maintaining AUC and $H$-measure over time
Nikolaj Tatti

TL;DR
This paper introduces efficient algorithms for maintaining AUC and H-measure in real-time as data points are added or removed, enabling faster performance monitoring of classifiers over time.
Contribution
It presents novel algorithms that efficiently update AUC and H-measure with dynamic data, improving over previous static computation methods.
Findings
Algorithms achieve O(log n) update time for AUC.
Convex hull maintenance enables efficient H-measure updates.
Empirical results show significant speed improvements over baselines.
Abstract
Measuring the performance of a classifier is a vital task in machine learning. The running time of an algorithm that computes the measure plays a very small role in an offline setting, for example, when the classifier is being developed by a researcher. However, the running time becomes more crucial if our goal is to monitor the performance of a classifier over time. In this paper we study three algorithms for maintaining two measures. The first algorithm maintains area under the ROC curve (AUC) under addition and deletion of data points in time. This is done by maintaining the data points sorted in a self-balanced search tree. In addition, we augment the search tree that allows us to query the ROC coordinates of a data point in time. In doing so we are able to maintain AUC in time. Our next two algorithms involve in maintaining -measure, an…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsImbalanced Data Classification Techniques · Machine Learning and Algorithms · Machine Learning and Data Classification
