On the Generalizability of Iterative Patch Selection for Memory-Efficient High-Resolution Image Classification
Max Riffi-Aslett, Christina Fell

TL;DR
This paper investigates the generalizability of iterative patch selection in memory-efficient high-resolution image classification, revealing how dataset size, patch size, and noise affect performance, and proposing improvements for low-data scenarios.
Contribution
It introduces a novel testbed extending megapixel MNIST to study patch selection effects, and analyzes how dataset size, patch size, and noise influence generalization in IPS-based models.
Findings
Lower O2I ratios hinder classifier generalization, especially with small datasets.
Tuning patch size smaller than ROI improves generalization in low-data settings.
Noise similarity to image content causes IPS to fail, affecting robustness.
Abstract
Classifying large images with small or tiny regions of interest (ROI) is challenging due to computational and memory constraints. Weakly supervised memory-efficient patch selectors have achieved results comparable with strongly supervised methods. However, low signal-to-noise ratios and low entropy attention still cause overfitting. We explore these issues using a novel testbed on a memory-efficient cross-attention transformer with Iterative Patch Selection (IPS) as the patch selection module. Our testbed extends the megapixel MNIST benchmark to four smaller O2I (object-to-image) ratios ranging from 0.01% to 0.14% while keeping the canvas size fixed and introducing a noise generation component based on B\'ezier curves. Experimental results generalize the observations made on CNNs to IPS whereby the O2I threshold below which the classifier fails to generalize is affected by the training…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsImage Retrieval and Classification Techniques · Medical Image Segmentation Techniques
MethodsSoftmax · Attention Is All You Need
