Loading paper
GraphEval36K: Benchmarking Coding and Reasoning Capabilities of Large Language Models on Graph Datasets | Tomesphere