Loading paper
A2RBench: An Automatic Paradigm for Formally Verifiable Abstract Reasoning Benchmark Generation | Tomesphere