V-CAS: A Realtime Vehicle Anti Collision System Using Vision Transformer on Multi-Camera Streams
Muhammad Waqas Ashraf, Ali Hassan, Imad Ali Shah

TL;DR
V-CAS is a real-time, multi-camera vision transformer-based system that improves vehicle collision avoidance by accurately perceiving the environment and proactively applying adaptive braking, demonstrating high accuracy and early alert capabilities.
Contribution
This work introduces V-CAS, a novel multi-camera, vision transformer-based system for real-time vehicle collision avoidance with enhanced perception and proactive safety features.
Findings
Achieved over 98% detection accuracy on the CCD dataset.
Provided an average proactive alert time of 1.13 seconds.
Enhanced collision avoidance performance over traditional single-camera systems.
Abstract
This paper introduces a real-time Vehicle Collision Avoidance System (V-CAS) designed to enhance vehicle safety through adaptive braking based on environmental perception. V-CAS leverages the advanced vision-based transformer model RT-DETR, DeepSORT tracking, speed estimation, brake light detection, and an adaptive braking mechanism. It computes a composite collision risk score based on vehicles' relative accelerations, distances, and detected braking actions, using brake light signals and trajectory data from multiple camera streams to improve scene perception. Implemented on the Jetson Orin Nano, V-CAS enables real-time collision risk assessment and proactive mitigation through adaptive braking. A comprehensive training process was conducted on various datasets for comparative analysis, followed by fine-tuning the selected object detection model using transfer learning. The system's…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsVideo Surveillance and Tracking Methods · Autonomous Vehicle Technology and Safety · Anomaly Detection Techniques and Applications
MethodsAttention Is All You Need · Linear Layer · Softmax · Dense Connections · Layer Normalization · Residual Connection · Multi-Head Attention · SPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings · Vision Transformer
