BI-LAVA: Biocuration with Hierarchical Image Labeling through Active Learning and Visual Analysis
Juan Trelles, Andrew Wentzel, William Berrios, G. Elisabeta, Marai

TL;DR
BI-LAVA is a system that combines active learning and visual analysis to facilitate hierarchical image labeling in biomedical datasets, addressing challenges of incomplete labels and complex taxonomies.
Contribution
It introduces an iterative visual analytics and active learning strategy implemented in BI-LAVA, enabling effective hierarchical image labeling with minimal labeled data.
Findings
Supports biocurators in exploring and understanding image datasets
Improves label quality and completeness through human-machine collaboration
Successfully handles incomplete and hierarchical labels in biomedical images
Abstract
In the biomedical domain, taxonomies organize the acquisition modalities of scientific images in hierarchical structures. Such taxonomies leverage large sets of correct image labels and provide essential information about the importance of a scientific publication, which could then be used in biocuration tasks. However, the hierarchical nature of the labels, the overhead of processing images, the absence or incompleteness of labeled data, and the expertise required to label this type of data impede the creation of useful datasets for biocuration. From a multi-year collaboration with biocurators and text-mining researchers, we derive an iterative visual analytics and active learning strategy to address these challenges. We implement this strategy in a system called BI-LAVA Biocuration with Hierarchical Image Labeling through Active Learning and Visual Analysis. BI-LAVA leverages a small…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsCell Image Analysis Techniques · Digital Imaging for Blood Diseases · Biomedical Text Mining and Ontologies
MethodsVisual Analytics
