Explaining Model Overfitting in CNNs via GMM Clustering

Hui Dou; Xinyu Mu; Mengjun Yi; Feng Han; Jian Zhao; Furao Shen

arXiv:2412.10457·cs.LG·December 17, 2024

Explaining Model Overfitting in CNNs via GMM Clustering

Hui Dou, Xinyu Mu, Mengjun Yi, Feng Han, Jian Zhao, Furao Shen

PDF

Open Access

TL;DR

This paper introduces a GMM-based clustering method to analyze CNN filters, providing insights into overfitting by identifying anomaly filters and offering a new perspective on model evaluation.

Contribution

It proposes a universal GMM clustering approach to assess CNN filters and analyze their relation to overfitting, applicable across various architectures without modifications.

Findings

01

GMM clustering effectively identifies anomaly filters.

02

Analysis reveals links between anomaly filters and overfitting.

03

Method applicable to models like AlexNet and LeNet-5.

Abstract

Convolutional Neural Networks (CNNs) have demonstrated remarkable prowess in the field of computer vision. However, their opaque decision-making processes pose significant challenges for practical applications. In this study, we provide quantitative metrics for assessing CNN filters by clustering the feature maps corresponding to individual filters in the model via Gaussian Mixture Model (GMM). By analyzing the clustering results, we screen out some anomaly filters associated with outlier samples. We further analyze the relationship between the anomaly filters and model overfitting, proposing three hypotheses. This method is universally applicable across diverse CNN architectures without modifications, as evidenced by its successful application to models like AlexNet and LeNet-5. We present three meticulously designed experiments demonstrating our hypotheses from the perspectives of…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning and Data Classification · Statistical and Computational Modeling · Explainable Artificial Intelligence (XAI)