SLIP & ETHICS: Graduated Intervention for AI Emotional Companions
Minseo Kim

TL;DR
This paper introduces SLIP and ETHICS, a graduated intervention framework for AI emotional companions that balances safety and rapport through staged responses based on affect and narrative signals.
Contribution
It proposes a novel four-stage intervention protocol and a taxonomy for interpreting interaction signals, with initial evaluation demonstrating improved detection and safety boundaries.
Findings
Zero false positives for flow persona in deployment
Escalation patterns observed in crisis-oriented personas
Model capability enhances detection accuracy from 0/8 to 6/8
Abstract
AI emotional companions face a safety-rapport paradox: restrictive safeguards can damage supportive alliance, while permissive systems risk user harm. We present SLIP (Staged Layers of Intervention Protocol), a four-stage graduated methodology deriving interventions (none, soft, hard) from structured qualitative indicators -- affect intensity (a) and narrative dynamism (m) -- alongside ETHICS (Emergent Taxonomy for Human-AI Interaction Context Signals), a "signals not labels" taxonomy. An evaluation combining a small-scale production deployment (N=68 entries, 10 users, 10 weeks) with a synthetic persona battery (N=91, 5 behavioral-risk profiles) achieved 0% false positives for the flow persona and showed expected escalation patterns in crisis-oriented personas. However, initial results showed that 8 consecutive days of high-energy elevation produced zero interventions (0/8), exposing a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
