Loading paper
HMR3D: Hierarchical Multimodal Representation for 3D Scene Understanding with Large Vision-Language Model | Tomesphere