Loading paper
When Models Outthink Their Safety: Unveiling and Mitigating Self-Jailbreak in Large Reasoning Models | Tomesphere