Combined convolutional and recurrent neural networks for hierarchical   classification of images

Jaehoon Koo; Diego Klabjan; Jean Utke

arXiv:1809.09574·cs.LG·November 19, 2019

Combined convolutional and recurrent neural networks for hierarchical classification of images

Jaehoon Koo, Diego Klabjan, Jean Utke

PDF

TL;DR

This paper introduces a hierarchical classification model combining CNNs and RNNs to better exploit class hierarchies in image classification, outperforming standard CNNs on real-world datasets.

Contribution

It presents a novel hybrid CNN-RNN architecture with residual learning for hierarchical image classification, improving over existing flat classifiers.

Findings

01

Hierarchical models outperform flat CNN classifiers.

02

Residual learning enhances training and generalization.

03

Model achieves better accuracy on proprietary dataset.

Abstract

Deep learning models based on CNNs are predominantly used in image classification tasks. Such approaches, assuming independence of object categories, normally use a CNN as a feature learner and apply a flat classifier on top of it. Object classes in many settings have hierarchical relations, and classifiers exploiting these relations should perform better. We propose hierarchical classification models combining a CNN to extract hierarchical representations of images, and an RNN or sequence-to-sequence model to capture a hierarchical tree of classes. In addition, we apply residual learning to the RNN part in oder to facilitate training our compound model and improve generalization of the model. Experimental results on a real world proprietary dataset of images show that our hierarchical networks perform better than state-of-the-art CNNs.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.