Towards Reliable Rare Category Analysis on Graphs via Individual Calibration
Longfeng Wu, Bowen Lei, Dongkuan Xu, Dawei Zhou

TL;DR
This paper introduces CALIRARE, a novel individual calibration framework for rare category analysis on graphs, addressing uncertainty quantification and miscalibration issues to improve reliability in high-stakes applications.
Contribution
The paper proposes a new individual calibration method, EICE, and a node-level uncertainty quantification algorithm to enhance the reliability of rare category analysis on graphs.
Findings
State-of-the-art RCA methods are over-confident on minority classes.
CALIRARE effectively calibrates predictions and quantifies uncertainties.
Experimental results demonstrate improved reliability in real-world datasets.
Abstract
Rare categories abound in a number of real-world networks and play a pivotal role in a variety of high-stakes applications, including financial fraud detection, network intrusion detection, and rare disease diagnosis. Rare category analysis (RCA) refers to the task of detecting, characterizing, and comprehending the behaviors of minority classes in a highly-imbalanced data distribution. While the vast majority of existing work on RCA has focused on improving the prediction performance, a few fundamental research questions heretofore have received little attention and are less explored: How confident or uncertain is a prediction model in rare category analysis? How can we quantify the uncertainty in the learning process and enable reliable rare category analysis? To answer these questions, we start by investigating miscalibration in existing RCA methods. Empirical results reveal that…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsText and Document Classification Technologies · Advanced Graph Neural Networks · Imbalanced Data Classification Techniques
