Loading paper
Position: Theory of Mind Benchmarks are Broken for Large Language Models | Tomesphere