Loading paper
DeepResearch Bench II: Diagnosing Deep Research Agents via Rubrics from Expert Report | Tomesphere