CNN2GNN: How to Bridge CNN with GNN

Ziheng Jiao; Hongyuan Zhang; Xuelong Li

arXiv:2404.14822·cs.CV·April 24, 2024·1 cites

CNN2GNN: How to Bridge CNN with GNN

Ziheng Jiao, Hongyuan Zhang, Xuelong Li

PDF

Open Access

TL;DR

This paper introduces CNN2GNN, a framework that combines CNN and GNN via distillation, enabling efficient learning of intra-sample features and dataset topology, leading to improved performance on vision tasks.

Contribution

The paper proposes a novel CNN2GNN framework with a differentiable graph learning module and response-based distillation to unify CNN and GNN for enhanced vision task performance.

Findings

01

Distilled GNN outperforms deep CNNs like ResNet152 on Mini-ImageNet.

02

Dynamic graph learning improves GNN's ability to model data relationships.

03

The approach reduces training costs while boosting accuracy.

Abstract

Although the convolutional neural network (CNN) has achieved excellent performance in vision tasks by extracting the intra-sample representation, it will take a higher training expense because of stacking numerous convolutional layers. Recently, as the bilinear models, graph neural networks (GNN) have succeeded in exploring the underlying topological relationship among the graph data with a few graph neural layers. Unfortunately, it cannot be directly utilized on non-graph data due to the lack of graph structure and has high inference latency on large-scale scenarios. Inspired by these complementary strengths and weaknesses, \textit{we discuss a natural question, how to bridge these two heterogeneous networks?} In this paper, we propose a novel CNN2GNN framework to unify CNN and GNN together via distillation. Firstly, to break the limitations of GNN, a differentiable sparse graph…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsHuman Pose and Action Recognition · Advanced Neural Network Applications · Face recognition and analysis