Outlier Detection In Large-scale Traffic Data By Na\"ive Bayes Method and Gaussian Mixture Model Method
Philip Lam, Lili Wang, Henry Y.T. Ngan, Nelson H.C. Yung, Anthony G.O., Yeh

TL;DR
This paper introduces Kernel Smoothing Naive Bayes and Gaussian Mixture Model methods for automatic outlier detection in large-scale traffic data, demonstrating high accuracy in identifying hardware errors and abnormal events.
Contribution
It presents two novel methods tailored for traffic outlier detection, including their modeling and performance evaluation on real traffic data.
Findings
Naive Bayes with Triangle kernel achieves 93.78% accuracy.
GMM method achieves 94.50% accuracy.
Both methods effectively detect outliers in traffic data.
Abstract
It is meaningful to detect outliers in traffic data for traffic management. However, this is a massive task for people from large-scale database to distinguish outliers. In this paper, we present two methods: Kernel Smoothing Na\"ive Bayes (NB) method and Gaussian Mixture Model (GMM) method to automatically detect any hardware errors as well as abnormal traffic events in traffic data collected at a four-arm junction in Hong Kong. Traffic data was recorded in a video format, and converted to spatial-temporal (ST) traffic signals by statistics. The ST signals are then projected to a two-dimensional (2D) (x,y)-coordinate plane by Principal Component Analysis (PCA) for dimension reduction. We assume that inlier data are normal distributed. As such, the NB and GMM methods are successfully applied in outlier detection (OD) for traffic data. The kernel smooth NB method assumes the existence of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAnomaly Detection Techniques and Applications · Advanced Statistical Methods and Models · Network Security and Intrusion Detection
