Loading paper
FormalMATH: Benchmarking Formal Mathematical Reasoning of Large Language Models | Tomesphere