PQ-DAF: Pose-driven Quality-controlled Data Augmentation for Data-scarce Driver Distraction Detection

Haibin Sun; Xinghui Song

arXiv:2508.10397·cs.CV·August 15, 2025

PQ-DAF: Pose-driven Quality-controlled Data Augmentation for Data-scarce Driver Distraction Detection

Haibin Sun, Xinghui Song

PDF

TL;DR

This paper introduces PQ-DAF, a novel data augmentation framework that uses pose-driven synthesis and quality filtering to improve driver distraction detection in data-scarce scenarios, enhancing model robustness across domains.

Contribution

The paper proposes a new pose-driven data augmentation method with quality control, leveraging vision-language models to generate and filter synthetic training data for better generalization.

Findings

01

Significant performance improvements in few-shot driver distraction detection.

02

Enhanced cross-domain robustness of the detection models.

03

Effective filtering of synthetic samples improves dataset reliability.

Abstract

Driver distraction detection is essential for improving traffic safety and reducing road accidents. However, existing models often suffer from degraded generalization when deployed in real-world scenarios. This limitation primarily arises from the few-shot learning challenge caused by the high cost of data annotation in practical environments, as well as the substantial domain shift between training datasets and target deployment conditions. To address these issues, we propose a Pose-driven Quality-controlled Data Augmentation Framework (PQ-DAF) that leverages a vision-language model for sample filtering to cost-effectively expand training data and enhance cross-domain robustness. Specifically, we employ a Progressive Conditional Diffusion Model (PCDMs) to accurately capture key driver pose features and synthesize diverse training examples. A sample quality assessment module, built upon…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.