Visual Recognition with Deep Nearest Centroids
Wenguan Wang, Cheng Han, Tianfei Zhou, Dongfang Liu

TL;DR
This paper introduces Deep Nearest Centroids (DNC), a nonparametric, explainable network for large-scale visual recognition that leverages class sub-centroids for classification, improving transparency and transferability.
Contribution
DNC revisits the classic Nearest Centroids classifier, integrating it into deep networks for improved explainability, transferability, and performance in image and pixel recognition tasks.
Findings
DNC outperforms parametric models on CIFAR-10 and ImageNet.
DNC enhances pixel recognition accuracy on ADE20K and Cityscapes.
DNC offers ad-hoc explainability by using actual training images as sub-centroids.
Abstract
We devise deep nearest centroids (DNC), a conceptually elegant yet surprisingly effective network for large-scale visual recognition, by revisiting Nearest Centroids, one of the most classic and simple classifiers. Current deep models learn the classifier in a fully parametric manner, ignoring the latent data structure and lacking simplicity and explainability. DNC instead conducts nonparametric, case-based reasoning; it utilizes sub-centroids of training samples to describe class distributions and clearly explains the classification as the proximity of test data and the class sub-centroids in the feature space. Due to the distance-based nature, the network output dimensionality is flexible, and all the learnable parameters are only for data embedding. That means all the knowledge learnt for ImageNet classification can be completely transferred for pixel recognition learning, under the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsRemote-Sensing Image Classification · Advanced Image and Video Retrieval Techniques · Advanced Neural Network Applications
MethodsTest
