Hierarchy-Guided Multimodal Representation Learning for Taxonomic Inference
Sk Miraj Ahmed, Xi Yu, Yunqi Li, Yuewei Lin, Wei Xu

TL;DR
This paper introduces hierarchy-aware multimodal learning methods for taxonomic inference, significantly improving biodiversity classification accuracy and robustness by encoding biological hierarchy and supporting flexible modality fusion.
Contribution
It presents two novel end-to-end models that incorporate hierarchical information and modality fusion, enhancing robustness and accuracy in biodiversity identification tasks.
Findings
Over 14% accuracy improvement over baselines
Significant gains under partial and corrupted DNA data
Enhanced robustness through hierarchy-aware embedding
Abstract
Accurate biodiversity identification from large-scale field data is a foundational problem with direct impact on ecology, conservation, and environmental monitoring. In practice, the core task is taxonomic prediction - inferring order, family, genus, or species from imperfect inputs such as specimen images, DNA barcodes, or both. Existing multimodal methods often treat taxonomy as a flat label space and therefore fail to encode the hierarchical structure of biological classification, which is critical for robustness under noise and missing modalities. We present two end-to-end variants for hierarchy-aware multimodal learning: CLiBD-HiR, which introduces Hierarchical Information Regularization (HiR) to shape embedding geometry across taxonomic levels, yielding structured and noise-robust representations; and CLiBD-HiR-Fuse, which additionally trains a lightweight fusion predictor that…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSpecies Distribution and Climate Change · Environmental DNA in Biodiversity Studies · Advanced Neural Network Applications
