PQ-DAF: Pose-driven Quality-controlled Data Augmentation for Data-scarce Driver Distraction Detection
Haibin Sun, Xinghui Song

TL;DR
This paper introduces PQ-DAF, a novel data augmentation framework that uses pose-driven synthesis and quality filtering to improve driver distraction detection in data-scarce scenarios, enhancing model robustness across domains.
Contribution
The paper proposes a new pose-driven data augmentation method with quality control, leveraging vision-language models to generate and filter synthetic training data for better generalization.
Findings
Significant performance improvements in few-shot driver distraction detection.
Enhanced cross-domain robustness of the detection models.
Effective filtering of synthetic samples improves dataset reliability.
Abstract
Driver distraction detection is essential for improving traffic safety and reducing road accidents. However, existing models often suffer from degraded generalization when deployed in real-world scenarios. This limitation primarily arises from the few-shot learning challenge caused by the high cost of data annotation in practical environments, as well as the substantial domain shift between training datasets and target deployment conditions. To address these issues, we propose a Pose-driven Quality-controlled Data Augmentation Framework (PQ-DAF) that leverages a vision-language model for sample filtering to cost-effectively expand training data and enhance cross-domain robustness. Specifically, we employ a Progressive Conditional Diffusion Model (PCDMs) to accurately capture key driver pose features and synthesize diverse training examples. A sample quality assessment module, built upon…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
