Loading paper
TReB: A Comprehensive Benchmark for Evaluating Table Reasoning Capabilities of Large Language Models | Tomesphere