Loading paper
MLLM as Retriever: Interactively Learning Multimodal Retrieval for Embodied Agents | Tomesphere