Loading paper
LLaVA-RE: Binary Image-Text Relevancy Evaluation with Multimodal Large Language Model | Tomesphere