Loading paper
When Safety Fails Before the Answer: Benchmarking Harmful Behavior Detection in Reasoning Chains | Tomesphere