Loading paper
BioXArena: Benchmarking LLM Agents on Multi-Modal Biomedical Machine Learning Tasks | Tomesphere