Loading paper
Cross-Modal Fine-Tuning: Align then Refine | Tomesphere