Care-Conditioned Neuromodulation for Autonomy-Preserving Supportive Dialogue Agents
Shalima Binta Manir, Tim Oates

TL;DR
This paper introduces Care-Conditioned Neuromodulation (CCN), a control framework for dialogue agents that balances helpfulness with user autonomy by modeling relational risks and optimizing response generation.
Contribution
It formalizes an autonomy-preserving alignment problem and demonstrates that CCN improves autonomy support while maintaining helpfulness in dialogue models.
Findings
CCN improves autonomy-preserving utility by +0.25 over supervised fine-tuning.
CCN improves utility by +0.07 over preference optimization baselines.
Human evaluation shows directional agreement with automated metrics.
Abstract
Large language models deployed in supportive or advisory roles must balance helpfulness with preservation of user autonomy, yet standard alignment methods primarily optimize for helpfulness and harmlessness without explicitly modeling relational risks such as dependency reinforcement, overprotection, or coercive guidance. We introduce Care-Conditioned Neuromodulation (CCN), a state-dependent control framework in which a learned scalar signal derived from structured user state and dialogue context conditions response generation and candidate selection. We formalize this setting as an autonomy-preserving alignment problem and define a utility function that rewards autonomy support and helpfulness while penalizing dependency and coercion. We also construct a benchmark of relational failure modes in multi-turn dialogue, including reassurance dependence, manipulative care, overprotection,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
