Loading paper
Hyperdimensional Cross-Modal Alignment of Frozen Language and Image Models for Efficient Image Captioning | Tomesphere