Loading paper
CityBench: Evaluating the Capabilities of Large Language Models for Urban Tasks | Tomesphere