Loading paper
Head-Specific Intervention Can Induce Misaligned AI Coordination in Large Language Models | Tomesphere