Loading paper
SEC-bench: Automated Benchmarking of LLM Agents on Real-World Software Security Tasks | Tomesphere