Loading paper
MINOS: A Multimodal Evaluation Model for Bidirectional Generation Between Image and Text | Tomesphere