Loading paper
XM-ALIGN: Unified Cross-Modal Embedding Alignment for Face-Voice Association | Tomesphere