Effects of Generative AI Errors on User Reliance Across Task Difficulty

Jacy Reese Anthis; Hannah Cha; Solon Barocas; Alexandra Chouldechova; Jake Hofman

arXiv:2604.04319·cs.CY·April 7, 2026

Effects of Generative AI Errors on User Reliance Across Task Difficulty

Jacy Reese Anthis, Hannah Cha, Solon Barocas, Alexandra Chouldechova, Jake Hofman

PDF

TL;DR

This study examines how errors in generative AI influence user reliance, revealing that error rate and task difficulty impact reliance differently than expected, with implications for AI design.

Contribution

Developed an experimental methodology to test effects of AI errors on user reliance across varying task difficulties, revealing unexpected resilience to easy-task errors.

Findings

01

Higher error rates decrease user reliance.

02

Easy-task errors did not significantly reduce reliance more than hard-task errors.

03

Users are not strongly averse to jagged error patterns in this setting.

Abstract

The capabilities of artificial intelligence (AI) lie along a jagged frontier, where AI systems surprisingly fail on tasks that humans find easy and succeed on tasks that humans find hard. To investigate user reactions to this phenomenon, we developed an incentive-compatible experimental methodology based on diagram generation tasks, in which we induce errors in generative AI output and test effects on user reliance. We demonstrate the interface in a preregistered 3x2 experiment (N = 577) with error rates of 10%, 30%, or 50% on easier or harder diagram generation tasks. We confirmed that observing more errors reduces use, but we unexpectedly found that easy-task errors did not significantly reduce use more than hard-task errors, suggesting that people are not averse to jaggedness in this experimental setting. We encourage future work that varies task difficulty at the same time as other…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.