Benchmarking Large Language Models for Image Classification of Marine Mammals
Yijiashun Qi, Shuzhang Cai, Zunduo Zhao, Jiaming Li, Yanbin Lin,, Zhiqiang Wang

TL;DR
This paper introduces a new benchmark dataset of 1,423 images of 65 marine mammal species, evaluates various AI models including LLMs and a novel multi-agent system for classification, and demonstrates their strengths and improvements.
Contribution
The work creates the first marine mammal-specific dataset and benchmarks multiple AI approaches, including a novel LLM-based multi-agent system for classification.
Findings
Traditional models perform well in certain aspects
LLMs show strong zero-shot classification abilities
The multi-agent system enhances classification accuracy
Abstract
As Artificial Intelligence (AI) has developed rapidly over the past few decades, the new generation of AI, Large Language Models (LLMs) trained on massive datasets, has achieved ground-breaking performance in many applications. Further progress has been made in multimodal LLMs, with many datasets created to evaluate LLMs with vision abilities. However, none of those datasets focuses solely on marine mammals, which are indispensable for ecological equilibrium. In this work, we build a benchmark dataset with 1,423 images of 65 kinds of marine mammals, where each animal is uniquely classified into different levels of class, ranging from species-level to medium-level to group-level. Moreover, we evaluate several approaches for classifying these marine mammals: (1) machine learning (ML) algorithms using embeddings provided by neural networks, (2) influential pre-trained neural networks, (3)…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBiomedical Text Mining and Ontologies · Image Retrieval and Classification Techniques · Machine Learning in Bioinformatics
MethodsContrastive Language-Image Pre-training · Mixing Adam and SGD
