Loading paper
Food-500 Cap: A Fine-Grained Food Caption Benchmark for Evaluating Vision-Language Models | Tomesphere