Loading paper
ART: Action-based Reasoning Task Benchmarking for Medical AI Agents | Tomesphere