Loading paper
MiroEval: Benchmarking Multimodal Deep Research Agents in Process and Outcome | Tomesphere