Loading paper
M3MAD-Bench: Are Multi-Agent Debates Really Effective Across Domains and Modalities? | Tomesphere