Loading paper
StacMR: Scene-Text Aware Cross-Modal Retrieval | Tomesphere