Detection of Intoxicated Individuals from Facial Video Sequences via a Recurrent Fusion Model

Bita Baroutian; Atefe Aghaei; Mohsen Ebrahimi Moghaddam

arXiv:2512.04536·cs.CV·December 5, 2025

Detection of Intoxicated Individuals from Facial Video Sequences via a Recurrent Fusion Model

Bita Baroutian, Atefe Aghaei, Mohsen Ebrahimi Moghaddam

PDF

Open Access

TL;DR

This paper presents a novel video-based facial analysis method combining facial landmarks and spatiotemporal features to accurately detect alcohol intoxication, outperforming existing approaches and supporting public safety applications.

Contribution

Introduces a new fusion model integrating facial landmark analysis with 3D visual features for alcohol intoxication detection, along with a curated dataset for training and evaluation.

Findings

01

Achieves 95.82% accuracy in intoxication detection

02

Outperforms baseline models in precision and recall

03

Demonstrates potential for real-world public safety deployment

Abstract

Alcohol consumption is a significant public health concern and a major cause of accidents and fatalities worldwide. This study introduces a novel video-based facial sequence analysis approach dedicated to the detection of alcohol intoxication. The method integrates facial landmark analysis via a Graph Attention Network (GAT) with spatiotemporal visual features extracted using a 3D ResNet. These features are dynamically fused with adaptive prioritization to enhance classification performance. Additionally, we introduce a curated dataset comprising 3,542 video segments derived from 202 individuals to support training and evaluation. Our model is compared against two baselines: a custom 3D-CNN and a VGGFace+LSTM architecture. Experimental results show that our approach achieves 95.82% accuracy, 0.977 precision, and 0.97 recall, outperforming prior methods. The findings demonstrate the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsEmotion and Mood Recognition · Brain Tumor Detection and Classification · Face and Expression Recognition