Loading paper
SimBench: Benchmarking the Ability of Large Language Models to Simulate Human Behaviors | Tomesphere