Loading paper
SWE-EVO: Benchmarking Coding Agents in Long-Horizon Software Evolution Scenarios | Tomesphere