V-CAS: A Realtime Vehicle Anti Collision System Using Vision Transformer on Multi-Camera Streams

Muhammad Waqas Ashraf; Ali Hassan; Imad Ali Shah

arXiv:2411.01963·cs.RO·May 28, 2025

V-CAS: A Realtime Vehicle Anti Collision System Using Vision Transformer on Multi-Camera Streams

Muhammad Waqas Ashraf, Ali Hassan, Imad Ali Shah

PDF

Open Access

TL;DR

V-CAS is a real-time, multi-camera vision transformer-based system that improves vehicle collision avoidance by accurately perceiving the environment and proactively applying adaptive braking, demonstrating high accuracy and early alert capabilities.

Contribution

This work introduces V-CAS, a novel multi-camera, vision transformer-based system for real-time vehicle collision avoidance with enhanced perception and proactive safety features.

Findings

01

Achieved over 98% detection accuracy on the CCD dataset.

02

Provided an average proactive alert time of 1.13 seconds.

03

Enhanced collision avoidance performance over traditional single-camera systems.

Abstract

This paper introduces a real-time Vehicle Collision Avoidance System (V-CAS) designed to enhance vehicle safety through adaptive braking based on environmental perception. V-CAS leverages the advanced vision-based transformer model RT-DETR, DeepSORT tracking, speed estimation, brake light detection, and an adaptive braking mechanism. It computes a composite collision risk score based on vehicles' relative accelerations, distances, and detected braking actions, using brake light signals and trajectory data from multiple camera streams to improve scene perception. Implemented on the Jetson Orin Nano, V-CAS enables real-time collision risk assessment and proactive mitigation through adaptive braking. A comprehensive training process was conducted on various datasets for comparative analysis, followed by fine-tuning the selected object detection model using transfer learning. The system's…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsVideo Surveillance and Tracking Methods · Autonomous Vehicle Technology and Safety · Anomaly Detection Techniques and Applications

MethodsAttention Is All You Need · Linear Layer · Softmax · Dense Connections · Layer Normalization · Residual Connection · Multi-Head Attention · SPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings · Vision Transformer