CoralVQA: A Large-Scale Visual Question Answering Dataset for Coral Reef Image Understanding

Hongyong Han; Wei Wang; Gaowei Zhang; Mingjie Li; Yi Wang

arXiv:2507.10449·cs.CV·November 4, 2025

CoralVQA: A Large-Scale Visual Question Answering Dataset for Coral Reef Image Understanding

Hongyong Han, Wei Wang, Gaowei Zhang, Mingjie Li, Yi Wang

PDF

1 Datasets 1 Video

TL;DR

CoralVQA introduces a large-scale, domain-specific VQA dataset for coral reef images, enabling advanced ecological analysis and supporting conservation efforts through vision-language reasoning.

Contribution

This paper presents the first extensive coral reef VQA dataset, CoralVQA, with a semi-automatic construction process involving marine biologists, addressing domain-specific challenges.

Findings

01

State-of-the-art LVLMs face limitations on coralVQA

02

CoralVQA enables comprehensive ecological question answering

03

Benchmark results highlight future research directions

Abstract

Coral reefs are vital yet vulnerable ecosystems that require continuous monitoring to support conservation. While coral reef images provide essential information in coral monitoring, interpreting such images remains challenging due to the need for domain expertise. Visual Question Answering (VQA), powered by Large Vision-Language Models (LVLMs), has great potential in user-friendly interaction with coral reef images. However, applying VQA to coral imagery demands a dedicated dataset that addresses two key challenges: domain-specific annotations and multidimensional questions. In this work, we introduce CoralVQA, the first large-scale VQA dataset for coral reef analysis. It contains 12,805 real-world coral images from 67 coral genera collected from 3 oceans, along with 277,653 question-answer pairs that comprehensively assess ecological and health-related conditions. To construct this…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Datasets

CoralReefData/CoralVQA
dataset· 139 dl
139 dl

Videos

CoralVQA: A Large-Scale Visual Question Answering Dataset for Coral Reef Image Understanding· slideslive