Learning in Unlabeled Networks - An Active Learning and Inference Approach
Tomasz Kajdanowicz, Rados{\l}aw Michalski, Katarzyna Musia{\l},, Przemys{\l}aw Kazienko

TL;DR
This paper proposes and validates active learning methods for classifying nodes in unlabeled networks, emphasizing the importance of network structure and introducing measure-neighbour strategies to improve accuracy.
Contribution
It introduces new utility score formulations and measure-neighbour methods, demonstrating their effectiveness based on network clustering properties.
Findings
Measure-neighbour methods outperform regular methods in highly clustered networks.
The effectiveness of active learning methods depends on the network's clustering coefficient.
Different utility scores impact classification accuracy across various network structures.
Abstract
The task of determining labels of all network nodes based on the knowledge about network structure and labels of some training subset of nodes is called the within-network classification. It may happen that none of the labels of the nodes is known and additionally there is no information about number of classes to which nodes can be assigned. In such a case a subset of nodes has to be selected for initial label acquisition. The question that arises is: "labels of which nodes should be collected and used for learning in order to provide the best classification accuracy for the whole network?". Active learning and inference is a practical framework to study this problem. A set of methods for active learning and inference for within network classification is proposed and validated. The utility score calculation for each node based on network structure is the first step in the process.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
