Elastic Interaction Energy-Informed Real-Time Traffic Scene Perception
Yaxin Feng, Yuan Lan, Luchan Zhang, Guoqing Liu, Yang Xiang

TL;DR
This paper introduces EIEGSeg, a novel training strategy using elastic interaction energy loss for real-time traffic scene segmentation, significantly improving accuracy on small and complex objects for autonomous driving.
Contribution
The paper presents a simple, efficient topology-aware energy loss function that enhances segmentation accuracy, especially for fine-scale structures in traffic scenes.
Findings
Improved segmentation accuracy on Cityscapes, TuSimple, and CULane datasets.
Enhanced detection of small and irregular objects.
Better performance on lightweight, real-time networks.
Abstract
Urban segmentation and lane detection are two important tasks for traffic scene perception. Accuracy and fast inference speed of visual perception are crucial for autonomous driving safety. Fine and complex geometric objects are the most challenging but important recognition targets in traffic scene, such as pedestrians, traffic signs and lanes. In this paper, a simple and efficient topology-aware energy loss function-based network training strategy named EIEGSeg is proposed. EIEGSeg is designed for multi-class segmentation on real-time traffic scene perception. To be specific, the convolutional neural network (CNN) extracts image features and produces multiple outputs, and the elastic interaction energy loss function (EIEL) drives the predictions moving toward the ground truth until they are completely overlapped. Our strategy performs well especially on fine-scale structure,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Autonomous Vehicle Technology and Safety · Video Surveillance and Tracking Methods
MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings
