Orderless Recurrent Models for Multi-label Classification

Vacit Oguz Yazici; Abel Gonzalez-Garcia; Arnau Ramisa; Bartlomiej; Twardowski; Joost van de Weijer

arXiv:1911.09996·cs.CV·March 13, 2020

Orderless Recurrent Models for Multi-label Classification

Vacit Oguz Yazici, Abel Gonzalez-Garcia, Arnau Ramisa, Bartlomiej, Twardowski, Joost van de Weijer

PDF

1 Repo 1 Video

TL;DR

This paper introduces a dynamic label ordering method for RNN-based multi-label classification that adapts to each image, leading to faster training and improved accuracy over traditional fixed ordering approaches.

Contribution

It proposes a novel dynamic label ordering technique for RNNs in multi-label classification, enhancing training efficiency and model performance.

Findings

01

Outperforms existing CNN-RNN models on MS-COCO, WIDER Attribute, and PA-100K datasets.

02

Avoids duplicate label generation common in other models.

03

Achieves state-of-the-art results with a standard encoder-decoder architecture.

Abstract

Recurrent neural networks (RNN) are popular for many computer vision tasks, including multi-label classification. Since RNNs produce sequential outputs, labels need to be ordered for the multi-label classification task. Current approaches sort labels according to their frequency, typically ordering them in either rare-first or frequent-first. These imposed orderings do not take into account that the natural order to generate the labels can change for each image, e.g.\ first the dominant object before summing up the smaller objects in the image. Therefore, in this paper, we propose ways to dynamically order the ground truth labels with the predicted label sequence. This allows for the faster training of more optimal LSTM models for multi-label classification. Analysis evidences that our method does not suffer from duplicate generation, something which is common for other models.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

voyazici/orderless-rnn-classification
pytorchOfficial

Videos

Orderless Recurrent Models for Multi-Label Classification· youtube

Taxonomy

MethodsSigmoid Activation · Tanh Activation · Long Short-Term Memory