Loading paper
Monocular 3D Object Position Estimation with VLMs for Human-Robot Interaction | Tomesphere