Revisiting Generalization Power of a DNN in Terms of Symbolic   Interactions

Lei Cheng; Junpeng Zhang; Qihan Ren; Quanshi Zhang

arXiv:2502.10162·cs.LG·February 17, 2025

Revisiting Generalization Power of a DNN in Terms of Symbolic Interactions

Lei Cheng, Junpeng Zhang, Qihan Ren, Quanshi Zhang

PDF

Open Access

TL;DR

This paper investigates the generalization capabilities of deep neural networks by analyzing the role of interactions, revealing distinct distribution patterns for generalizable versus non-generalizable interactions and providing a method to disentangle them.

Contribution

It introduces a novel interaction-based perspective on DNN generalization, with a theory that distinguishes between types of interactions and matches real network behaviors.

Findings

01

Generalizable interactions follow a decay-shaped distribution.

02

Non-generalizable interactions follow a spindle-shaped distribution.

03

The theory effectively disentangles interaction types in DNNs.

Abstract

This paper aims to analyze the generalization power of deep neural networks (DNNs) from the perspective of interactions. Unlike previous analysis of a DNN's generalization power in a highdimensional feature space, we find that the generalization power of a DNN can be explained as the generalization power of the interactions. We found that the generalizable interactions follow a decay-shaped distribution, while non-generalizable interactions follow a spindle-shaped distribution. Furthermore, our theory can effectively disentangle these two types of interactions from a DNN. We have verified that our theory can well match real interactions in a DNN in experiments.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNeural Networks and Applications