Sequential Random Network for Fine-grained Image Classification

Chaorong Li; Malu Zhang; Wei Huang; Fengqing Qin; Anping Zeng,; Yuanyuan Huang

arXiv:2103.07230·cs.CV·June 8, 2021·1 cites

Sequential Random Network for Fine-grained Image Classification

Chaorong Li, Malu Zhang, Wei Huang, Fengqing Qin, Anping Zeng,, Yuanyuan Huang

PDF

Open Access

TL;DR

This paper introduces a Sequence Random Network (SRN) that enhances deep learning models for fine-grained image classification by emphasizing detailed features, significantly improving accuracy over existing methods.

Contribution

The paper proposes a novel SRN architecture combining BiLSTM and Tanh-Dropout blocks to better capture image details in fine-grained classification tasks.

Findings

01

Achieved over 99% accuracy on five fine-grained datasets

02

Outperformed existing state-of-the-art methods

03

Applicable to models beyond DCNN, including Transformers

Abstract

Deep Convolutional Neural Network (DCNN) and Transformer have achieved remarkable successes in image recognition. However, their performance in fine-grained image recognition is still difficult to meet the requirements of actual needs. This paper proposes a Sequence Random Network (SRN) to enhance the performance of DCNN. The output of DCNN is one-dimensional features. This one-dimensional feature abstractly represents image information, but it does not express well the detailed information of image. To address this issue, we use the proposed SRN which composed of BiLSTM and several Tanh-Dropout blocks (called BiLSTM-TDN), to further process DCNN one-dimensional features for highlighting the detail information of image. After the feature transform by BiLSTM-TDN, the recognition performance has been greatly improved. We conducted the experiments on six fine-grained image datasets. Except…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDomain Adaptation and Few-Shot Learning · Neural Networks and Applications · Advanced Image and Video Retrieval Techniques

MethodsLinear Layer · Diffusion-Convolutional Neural Networks · Absolute Position Encodings · Position-Wise Feed-Forward Layer · Byte Pair Encoding · Dropout · Attention Is All You Need · Label Smoothing · Adam · Residual Connection