Loading paper
V$^2$L: Leveraging Vision and Vision-language Models into Large-scale Product Retrieval | Tomesphere