Loading paper
SEA-Eval: A Benchmark for Evaluating Self-Evolving Agents Beyond Episodic Assessment | Tomesphere