Revisiting Sparse Convolutional Model for Visual Recognition

Xili Dai; Mingyang Li; Pengyuan Zhai; Shengbang Tong; Xingjian Gao,; Shao-Lun Huang; Zhihui Zhu; Chong You; Yi Ma

arXiv:2210.12945·cs.CV·October 25, 2022·21 cites

Revisiting Sparse Convolutional Model for Visual Recognition

Xili Dai, Mingyang Li, Pengyuan Zhai, Shengbang Tong, Xingjian Gao,, Shao-Lun Huang, Zhihui Zhu, Chong You, Yi Ma

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper revisits sparse convolutional models for image classification, integrating them into deep networks to combine interpretability with competitive empirical performance and robustness to input perturbations.

Contribution

It introduces differentiable optimization layers based on convolutional sparse coding as replacements for standard convolutional layers in deep neural networks.

Findings

01

Achieves comparable accuracy to standard deep networks on CIFAR-10, CIFAR-100, and ImageNet.

02

Demonstrates increased robustness to input corruptions and adversarial attacks.

03

Bridges the gap between interpretability of sparse models and empirical performance of deep learning.

Abstract

Despite strong empirical performance for image classification, deep neural networks are often regarded as ``black boxes'' and they are difficult to interpret. On the other hand, sparse convolutional models, which assume that a signal can be expressed by a linear combination of a few elements from a convolutional dictionary, are powerful tools for analyzing natural images with good theoretical interpretability and biological plausibility. However, such principled models have not demonstrated competitive performance when compared with empirically designed deep networks. This paper revisits the sparse convolutional modeling for image classification and bridges the gap between good empirical performance (of deep learning) and good interpretability (of sparse convolutional models). Our method uses differentiable optimization layers that are defined from convolutional sparse coding as drop-in…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

delay-xili/sdnet
pytorchOfficial

Videos

Revisiting Sparse Convolutional Model for Visual Recognition· slideslive

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Cell Image Analysis Techniques · Domain Adaptation and Few-Shot Learning