Efficient Visual Fault Detection for Freight Train Braking System via Heterogeneous Self Distillation in the Wild
Yang Zhang, Huilin Pan, Yang Zhou, Mingying Li, Guodong Sun

TL;DR
This paper introduces a lightweight heterogeneous self-distillation framework for efficient visual fault detection in freight trains, achieving high accuracy and speed with low resource consumption suitable for real-world railway safety applications.
Contribution
The paper presents a novel heterogeneous self-distillation approach with a lightweight backbone and a new knowledge neck, improving detection accuracy and efficiency in resource-constrained environments.
Findings
Achieves over 37 frames per second in fault detection.
Maintains highest accuracy compared to traditional distillation methods.
Uses less memory and has the smallest model size among competitors.
Abstract
Efficient visual fault detection of freight trains is a critical part of ensuring the safe operation of railways under the restricted hardware environment. Although deep learning-based approaches have excelled in object detection, the efficiency of freight train fault detection is still insufficient to apply in real-world engineering. This paper proposes a heterogeneous self-distillation framework to ensure detection accuracy and speed while satisfying low resource requirements. The privileged information in the output feature knowledge can be transferred from the teacher to the student model through distillation to boost performance. We first adopt a lightweight backbone to extract features and generate a new heterogeneous knowledge neck. Such neck models positional information and long-range dependencies among channels through parallel encoding to optimize feature extraction…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsInfrastructure Maintenance and Monitoring · Hand Gesture Recognition Systems · Industrial Vision Systems and Defect Detection
MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings
