Loading paper
Agent psychometrics: Task-level performance prediction in agentic coding benchmarks | Tomesphere