Loading paper
DisCo-CLIP: A Distributed Contrastive Loss for Memory Efficient CLIP Training | Tomesphere