Loading paper
SoftHateBench: Evaluating Moderation Models Against Reasoning-Driven, Policy-Compliant Hostility | Tomesphere