Combined convolutional and recurrent neural networks for hierarchical classification of images
Jaehoon Koo, Diego Klabjan, Jean Utke

TL;DR
This paper introduces a hierarchical classification model combining CNNs and RNNs to better exploit class hierarchies in image classification, outperforming standard CNNs on real-world datasets.
Contribution
It presents a novel hybrid CNN-RNN architecture with residual learning for hierarchical image classification, improving over existing flat classifiers.
Findings
Hierarchical models outperform flat CNN classifiers.
Residual learning enhances training and generalization.
Model achieves better accuracy on proprietary dataset.
Abstract
Deep learning models based on CNNs are predominantly used in image classification tasks. Such approaches, assuming independence of object categories, normally use a CNN as a feature learner and apply a flat classifier on top of it. Object classes in many settings have hierarchical relations, and classifiers exploiting these relations should perform better. We propose hierarchical classification models combining a CNN to extract hierarchical representations of images, and an RNN or sequence-to-sequence model to capture a hierarchical tree of classes. In addition, we apply residual learning to the RNN part in oder to facilitate training our compound model and improve generalization of the model. Experimental results on a real world proprietary dataset of images show that our hierarchical networks perform better than state-of-the-art CNNs.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
