Loading paper
Linear Alignment of Vision-language Models for Image Captioning | Tomesphere