Label-efficient underwater species classification with logistic regression on frozen foundation model embeddings

Thomas Manuel Rost

arXiv:2604.00313·cs.CV·April 17, 2026

Label-efficient underwater species classification with logistic regression on frozen foundation model embeddings

Thomas Manuel Rost

PDF

TL;DR

This paper demonstrates that a simple logistic regression classifier on frozen foundation model embeddings can effectively classify underwater species with minimal supervision, matching or surpassing more complex models.

Contribution

It shows that frozen foundation model embeddings enable label-efficient underwater species classification without additional training or domain-specific engineering.

Findings

01

Logistic regression on frozen embeddings achieves 88.5% macro F1 at full supervision.

02

With only 21 labeled examples per class, macro F1 exceeds 80%.

03

The approach requires no fine-tuning or domain-specific data engineering.

Abstract

Automated species classification from underwater imagery is bottlenecked by the cost of expert annotation, and supervised models trained on one dataset rarely transfer to new conditions. We investigate whether a simple classifier operating on frozen foundation model embeddings can close this gap. Using frozen DINOv3 ViT-B/16 embeddings with no fine-tuning, we train a logistic regression classifier and evaluate on the AQUA20 benchmark (20 marine species). At full supervision, logistic regression achieves 88.5% macro F1 compared to ConvNeXt's 88.9%, a gap of 0.4 percentage points, while outperforming the supervised baseline on 8 of 20 species. Under label scarcity, with 21 labeled examples per class (approximately 6% of training labels), macro F1 exceeds 80%. The near-parity with end-to-end supervised learning demonstrates that these general-purpose, frozen representations exhibit strong…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.