Loading paper
Assessing the Value of Visual Input: A Benchmark of Multimodal Large Language Models for Robotic Path Planning | Tomesphere