Automatic Rail Component Detection Based on AttnConv-Net

Tiange Wang; Zijun Zhang; Fangfang Yang; and Kwok-Leung Tsui

arXiv:2108.02423·cs.CV·February 14, 2022

Automatic Rail Component Detection Based on AttnConv-Net

Tiange Wang, Zijun Zhang, Fangfang Yang, and Kwok-Leung Tsui

PDF

TL;DR

This paper introduces AttnConv-Net, an attention-based deep learning model that improves the detection of rail components in images, offering faster and more accurate results without complex pre- or post-processing.

Contribution

The paper presents a novel attention-powered deep convolutional network with cascading attention blocks for rail component detection, simplifying the detection pipeline and enhancing small object detection.

Findings

01

Outperforms classic CNN methods in accuracy and speed

02

Effectively detects multiple rail components including small parts

03

Validated on real and synthetic datasets

Abstract

The automatic detection of major rail components using railway images is beneficial to ensure the rail transport safety. In this paper, we propose an attention-powered deep convolutional network (AttnConv-net) to detect multiple rail components including the rail, clips, and bolts. The proposed method consists of a deep convolutional neural network (DCNN) as the backbone, cascading attention blocks (CAB), and two feed forward networks (FFN). Two types of positional embedding are applied to enrich information in latent features extracted from the backbone. Based on processed latent features, the CAB aims to learn the local context of rail components including their categories and component boundaries. Final categories and bounding boxes are generated via two FFN implemented in parallel. To enhance the detection of small components, various data augmentation methods are employed in the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.