Relevance-driven Input Dropout: an Explanation-guided Regularization Technique
Shreyas Gururaj, Lars Gr\"une, Wojciech Samek, Sebastian Lapuschkin, Leander Weber

TL;DR
Relevance-driven Input Dropout (RelDrop) is a new regularization technique that selectively occludes important input regions during training, encouraging models to utilize diverse features and thereby enhancing generalization and robustness.
Contribution
The paper introduces RelDrop, a novel relevance-guided data augmentation method that improves model generalization by selectively occluding key input regions based on their relevance.
Findings
RelDrop enhances robustness to occlusion.
Models trained with RelDrop utilize more diverse features.
RelDrop improves inference time generalization.
Abstract
Overfitting is a well-known issue extending even to state-of-the-art (SOTA) Machine Learning (ML) models, resulting in reduced generalization, and a significant train-test performance gap. Mitigation measures include a combination of dropout, data augmentation, weight decay, and other regularization techniques. Among the various data augmentation strategies, occlusion is a prominent technique that typically focuses on randomly masking regions of the input during training. Most of the existing literature emphasizes randomness in selecting and modifying the input features instead of regions that strongly influence model decisions. We propose Relevance-driven Input Dropout (RelDrop), a novel data augmentation method which selectively occludes the most relevant regions of the input, nudging the model to use other important features in the prediction process, thus improving model…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsExplainable Artificial Intelligence (XAI) · Generative Adversarial Networks and Image Synthesis · Domain Adaptation and Few-Shot Learning
