Loading paper
Router-R1: Teaching LLMs Multi-Round Routing and Aggregation via Reinforcement Learning | Tomesphere