Loading paper
Which One Are You Referring To? Multimodal Object Identification in Situated Dialogue | Tomesphere