DEEGITS: Deep Learning based Framework for Measuring Heterogenous Traffic State in Challenging Traffic Scenarios
Muttahirul Islam, Nazmul Haque, Md. Hadiuzzaman

TL;DR
DEEGITS is a deep learning framework that accurately detects and tracks vehicles and pedestrians in challenging traffic scenarios, enabling precise measurement of heterogeneous traffic states using CNNs, transfer learning, and multi-object tracking.
Contribution
This study introduces DEEGITS, a novel framework combining CNN-based detection, transfer learning, and multi-object tracking for measuring traffic states in complex scenarios.
Findings
Achieved 0.794 [email protected] detection accuracy.
Surpassed previous benchmarks on similar datasets.
Demonstrated high correlation in traffic measurement accuracy.
Abstract
This paper presents DEEGITS (Deep Learning Based Heterogeneous Traffic State Measurement), a comprehensive framework that leverages state-of-the-art convolutional neural network (CNN) techniques to accurately and rapidly detect vehicles and pedestrians, as well as to measure traffic states in challenging scenarios (i.e., congestion, occlusion). In this study, we enhance the training dataset through data fusion, enabling simultaneous detection of vehicles and pedestrians. Image preprocessing and augmentation are subsequently performed to improve the quality and quantity of the dataset. Transfer learning is applied on the YOLOv8 pretrained model to increase the model's capability to identify a diverse array of vehicles. Optimal hyperparameters are obtained using the Grid Search algorithm, with the Stochastic Gradient Descent (SGD) optimizer outperforming other optimizers under these…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTraffic Prediction and Management Techniques
MethodsSparse Evolutionary Training · You Only Look Once · SPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings
