Loading paper
Consistent but Dangerous: Per-Sample Safety Classification Reveals False Reliability in Medical Vision-Language Models | Tomesphere