Loading paper
LITTA: Late-Interaction and Test-Time Alignment for Visually-Grounded Multimodal Retrieval | Tomesphere