A Highly Efficient Diversity-based Input Selection for DNN Improvement Using VLMs
Amin Abbasishahkoo, Mahboubeh Dadkhah, Lionel Briand

TL;DR
This paper introduces Concept-Based Diversity (CBD), an efficient image input selection metric leveraging Vision-Language Models, which improves DNN performance while reducing computational costs compared to existing methods.
Contribution
The paper proposes CBD, a novel, scalable diversity metric based on VLMs, and demonstrates its effectiveness in enhancing DNNs through hybrid selection with uncertainty measures.
Findings
CBD correlates strongly with Geometric Diversity but is computationally cheaper.
CBD-based selection outperforms state-of-the-art baselines in improving DNNs.
The approach remains efficient even on large datasets like ImageNet.
Abstract
Maintaining or improving the performance of Deep Neural Networks (DNNs) through fine-tuning requires labeling newly collected inputs, a process that is often costly and time-consuming. To alleviate this problem, input selection approaches have been developed in recent years to identify small, yet highly informative subsets for labeling. Diversity-based selection is one of the most effective approaches for this purpose. However, they are often computationally intensive and lack scalability for large input sets, limiting their practical applicability. To address this challenge, we introduce Concept-Based Diversity (CBD), a highly efficient metric for image inputs that leverages Vision-Language Models (VLM). Our results show that CBD exhibits a strong correlation with Geometric Diversity (GD), an established diversity metric, while requiring only a fraction of its computation time.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsExplainable Artificial Intelligence (XAI) · Multimodal Machine Learning Applications · Domain Adaptation and Few-Shot Learning
