Loading paper
How Far Are LLMs from Believable AI? A Benchmark for Evaluating the Believability of Human Behavior Simulation | Tomesphere