Bi-Directional Multi-Scale Graph Dataset Condensation via Information   Bottleneck

Xingcheng Fu; Yisen Gao; Beining Yang; Yuxuan Wu; Haodong Qian,; Qingyun Sun; Xianxian Li

arXiv:2412.17355·cs.LG·December 24, 2024

Bi-Directional Multi-Scale Graph Dataset Condensation via Information Bottleneck

Xingcheng Fu, Yisen Gao, Beining Yang, Yuxuan Wu, Haodong Qian,, Qingyun Sun, Xianxian Li

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper introduces a bi-directional multi-scale graph dataset condensation framework that preserves maximum information across scales, improving efficiency and stability in graph data compression for diverse on-device scenarios.

Contribution

It proposes a novel GNN-centric bi-directional condensation method based on mutual information theory and eigenbasis matching, unifying large-to-small and small-to-large scale paradigms.

Findings

01

Outperforms existing methods in graph condensation across multiple datasets.

02

Achieves stable and consistent multi-scale graph compression.

03

Effectively preserves original graph information at different scales.

Abstract

Dataset condensation has significantly improved model training efficiency, but its application on devices with different computing power brings new requirements for different data sizes. Thus, condensing multiple scale graphs simultaneously is the core of achieving efficient training in different on-device scenarios. Existing efficient works for multi-scale graph dataset condensation mainly perform efficient approximate computation in scale order (large-to-small or small-to-large scales). However, for non-Euclidean structures of sparse graph data, these two commonly used paradigms for multi-scale graph dataset condensation have serious scaling down degradation and scaling up collapse problems of a graph. The main bottleneck of the above paradigms is whether the effective information of the original graph is fully preserved when consenting to the primary sub-scale (the first of multiple…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

ringbdstack/bimsgc
pytorchOfficial

Videos

Bi-Directional Multi-Scale Graph Dataset Condensation via Information Bottleneck· underline

Taxonomy

TopicsAdvanced Graph Neural Networks · Graph Theory and Algorithms · Text and Document Classification Technologies