Loading paper
SEPS: Semantic-enhanced Patch Slimming Framework for fine-grained cross-modal alignment | Tomesphere