When not to help: planning for lasting human-AI collaboration
Mark Steyvers, Lukas Mayer

TL;DR
This paper presents a POMDP-based framework for AI systems to decide when to assist users, balancing help effectiveness with long-term user engagement to prevent disengagement and optimize collaboration.
Contribution
It introduces a novel cognitive modeling approach that uses counterfactual reasoning within POMDPs to optimize AI assistance timing for sustained human-AI collaboration.
Findings
Adaptive assistance outperforms always-help or never-help policies.
Counterfactual reasoning improves long-term user engagement.
Balancing short-term accuracy with engagement enhances collaboration.
Abstract
AI systems and technologies that can interact with humans in real time face a communication dilemma: when to offer assistance and how frequently. Overly frequent or contextually redundant assistance can cause users to disengage, undermining the long-term benefits of AI assistance. We introduce a cognitive modeling framework based on Partially Observable Markov Decision Processes (POMDPs) that addresses this timing challenge by inferring a user's latent cognitive state related to AI engagement over time. Additionally, our framework incorporates reasoning about the long-term effects of AI assistance, explicitly aiming to avoid actions that could lead the human user to disengage or deactivate the AI. A key component of our approach is counterfactual reasoning: at each time step, the AI considers how well the user would perform independently and weighs the potential boost in performance…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHuman-Automation Interaction and Safety · Explainable Artificial Intelligence (XAI) · Ethics and Social Impacts of AI
