Making AI Drafts Count: A Quality Threshold in Audio Description Workflows
Lana Do, Shasta Ihorn, Charity M. Pitcher-Cooper, Sanjay Mirani, Gio Jung, Hyunjoo Shim, Zhenzhen Qin, Kien T. Nguyen, Vassilis Athitsos, and Ilmi Yoon

TL;DR
This paper explores how the quality of AI-generated drafts impacts the efficiency and effectiveness of audio description editing, emphasizing a minimum quality threshold for useful AI assistance.
Contribution
It introduces GenAD and RefineAD, demonstrating that high-quality AI drafts significantly improve audio description workflows and establish a content-dependent quality threshold.
Findings
GenAD drafts reduce completion time by over 50%
Higher-quality AI drafts lower cognitive load during editing
Content complexity influences the required AI draft quality
Abstract
Audio description (AD) narrates visual elements in video for blind and low-vision audiences. Recent work has shown that giving novice describers an AI-generated draft to start from helps produce higher-quality AD and lowers the barrier to entry. What remains an open question is how draft quality shapes the editing process. We investigate this through GenAD, an AD generation pipeline that incorporates accessibility guidelines and contextual video information, and RefineAD, an editing interface for human revisions. Human-AI contributions are measured across text, timing, and delivery. In a within-subjects study, we compared authoring from scratch against editing AI drafts of varying quality. GenAD drafts cut completion time by more than half and significantly reduced cognitive load. In contrast, baseline drafts generated from simple, unguided prompts offered only modest benefits, pointing…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
