Elastic Interaction Energy-Informed Real-Time Traffic Scene Perception

Yaxin Feng; Yuan Lan; Luchan Zhang; Guoqing Liu; Yang Xiang

arXiv:2310.01449·cs.CV·April 4, 2024

Elastic Interaction Energy-Informed Real-Time Traffic Scene Perception

Yaxin Feng, Yuan Lan, Luchan Zhang, Guoqing Liu, Yang Xiang

PDF

Open Access

TL;DR

This paper introduces EIEGSeg, a novel training strategy using elastic interaction energy loss for real-time traffic scene segmentation, significantly improving accuracy on small and complex objects for autonomous driving.

Contribution

The paper presents a simple, efficient topology-aware energy loss function that enhances segmentation accuracy, especially for fine-scale structures in traffic scenes.

Findings

01

Improved segmentation accuracy on Cityscapes, TuSimple, and CULane datasets.

02

Enhanced detection of small and irregular objects.

03

Better performance on lightweight, real-time networks.

Abstract

Urban segmentation and lane detection are two important tasks for traffic scene perception. Accuracy and fast inference speed of visual perception are crucial for autonomous driving safety. Fine and complex geometric objects are the most challenging but important recognition targets in traffic scene, such as pedestrians, traffic signs and lanes. In this paper, a simple and efficient topology-aware energy loss function-based network training strategy named EIEGSeg is proposed. EIEGSeg is designed for multi-class segmentation on real-time traffic scene perception. To be specific, the convolutional neural network (CNN) extracts image features and produces multiple outputs, and the elastic interaction energy loss function (EIEL) drives the predictions moving toward the ground truth until they are completely overlapped. Our strategy performs well especially on fine-scale structure,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · Autonomous Vehicle Technology and Safety · Video Surveillance and Tracking Methods

MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings