Loading paper
ResearchArena: Benchmarking Large Language Models' Ability to Collect and Organize Information as Research Agents | Tomesphere