Loading paper
Scene-R1: Video-Grounded Large Language Models for 3D Scene Reasoning without 3D Annotations | Tomesphere