Loading paper
CIBench: Evaluating Your LLMs with a Code Interpreter Plugin | Tomesphere