Automatic Image-Level Morphological Trait Annotation for Organismal Images
Vardaan Pahuja, Samuel Stevens, Alyson East, Sydne Record, Yu Su

TL;DR
This paper presents a scalable, automated method for annotating morphological traits in organism images, creating a large dataset and enabling ecological studies with minimal manual effort.
Contribution
It introduces a novel trait annotation pipeline using autoencoders and foundation-model features, resulting in the Bioscan-Traits dataset of 80K annotations.
Findings
Human evaluation confirms biological plausibility of descriptions
Systematic ablation study shows impact of design choices on quality
Constructed a large, diverse trait dataset for ecological analysis
Abstract
Morphological traits are physical characteristics of biological organisms that provide vital clues on how organisms interact with their environment. Yet extracting these traits remains a slow, expert-driven process, limiting their use in large-scale ecological studies. A major bottleneck is the absence of high-quality datasets linking biological images to trait-level annotations. In this work, we demonstrate that sparse autoencoders trained on foundation-model features yield monosemantic, spatially grounded neurons that consistently activate on meaningful morphological parts. Leveraging this property, we introduce a trait annotation pipeline that localizes salient regions and uses vision-language prompting to generate interpretable trait descriptions. Using this approach, we construct Bioscan-Traits, a dataset of 80K trait annotations spanning 19K insect images from BIOSCAN-5M. Human…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
