Using Wavelets to Analyze Similarities in Image-Classification Datasets
Roozbeh Yousefzadeh

TL;DR
This paper introduces a fast, wavelet-based method to analyze similarities in image classification datasets, providing insights into dataset structure and model generalization without requiring pre-trained models.
Contribution
The authors develop a novel, efficient approach using wavelet decomposition for dataset analysis that bypasses the need for pre-trained models, enabling rapid similarity detection.
Findings
Similar images can be identified in seconds
Insights into dataset structure can inform model generalization
Method corroborates previous CNN-based similarity analyses
Abstract
Deep learning image classifiers usually rely on huge training sets and their training process can be described as learning the similarities and differences among training images. But, images in large training sets are not usually studied from this perspective and fine-level similarities and differences among images is usually overlooked. This is due to lack of fast and efficient computational methods to analyze the contents of these datasets. Some studies aim to identify the influential and redundant training images, but such methods require a model that is already trained on the entire training set. Here, using image processing and numerical analysis tools we develop a practical and fast method to analyze the similarities in image classification datasets. We show that such analysis can provide valuable insights about the datasets and the classification task at hand, prior to training a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsImage and Signal Denoising Methods · Advanced Image and Video Retrieval Techniques · Image Processing Techniques and Applications
MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings
