Understanding Visual Feature Reliance through the Lens of Complexity
Thomas Fel, Louis Bethune, Andrew Kyle Lampinen, Thomas Serre,, Katherine Hermann

TL;DR
This paper introduces a new metric based on $\
Contribution
It proposes a $\\mathscr{V}$-information based metric to quantify feature complexity and analyzes how features of varying complexity are learned and utilized in vision models.
Findings
Simpler features dominate early training stages.
Complex features are less important for decisions.
Important features become accessible at earlier layers.
Abstract
Recent studies suggest that deep learning models inductive bias towards favoring simpler features may be one of the sources of shortcut learning. Yet, there has been limited focus on understanding the complexity of the myriad features that models learn. In this work, we introduce a new metric for quantifying feature complexity, based on -information and capturing whether a feature requires complex computational transformations to be extracted. Using this -information metric, we analyze the complexities of 10,000 features, represented as directions in the penultimate layer, that were extracted from a standard ImageNet-trained vision model. Our study addresses four key questions: First, we ask what features look like as a function of complexity and find a spectrum of simple to complex features present within the model. Second, we ask when features are learned…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Generative Adversarial Networks and Image Synthesis · Cell Image Analysis Techniques
MethodsFocus
