HD-CNN: Hierarchical Deep Convolutional Neural Network for Large Scale Visual Recognition
Zhicheng Yan, Hao Zhang, Robinson Piramuthu, Vignesh Jagadeesh, Dennis, DeCoste, Wei Di, Yizhou Yu

TL;DR
This paper introduces HD-CNN, a hierarchical deep convolutional neural network that leverages category hierarchies to improve large-scale visual recognition accuracy, outperforming flat CNNs on CIFAR100 and ImageNet datasets.
Contribution
The paper presents a novel hierarchical CNN architecture that incorporates category hierarchies, enabling more effective classification of difficult categories and scalable training methods.
Findings
Achieved state-of-the-art results on CIFAR100 and ImageNet datasets.
Lowered top-1 error rates by 1.1% to 3.1% compared to standard CNNs.
Demonstrated scalability with conditional execution and parameter compression.
Abstract
In image classification, visual separability between different object categories is highly uneven, and some categories are more difficult to distinguish than others. Such difficult categories demand more dedicated classifiers. However, existing deep convolutional neural networks (CNN) are trained as flat N-way classifiers, and few efforts have been made to leverage the hierarchical structure of categories. In this paper, we introduce hierarchical deep CNNs (HD-CNNs) by embedding deep CNNs into a category hierarchy. An HD-CNN separates easy classes using a coarse category classifier while distinguishing difficult classes using fine category classifiers. During HD-CNN training, component-wise pretraining is followed by global finetuning with a multinomial logistic loss regularized by a coarse category consistency term. In addition, conditional executions of fine category classifiers and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Image and Video Retrieval Techniques · Advanced Neural Network Applications · Domain Adaptation and Few-Shot Learning
