A Highly Efficient Diversity-based Input Selection for DNN Improvement Using VLMs

Amin Abbasishahkoo; Mahboubeh Dadkhah; Lionel Briand

arXiv:2601.08024·cs.CV·January 14, 2026

A Highly Efficient Diversity-based Input Selection for DNN Improvement Using VLMs

Amin Abbasishahkoo, Mahboubeh Dadkhah, Lionel Briand

PDF

Open Access

TL;DR

This paper introduces Concept-Based Diversity (CBD), an efficient image input selection metric leveraging Vision-Language Models, which improves DNN performance while reducing computational costs compared to existing methods.

Contribution

The paper proposes CBD, a novel, scalable diversity metric based on VLMs, and demonstrates its effectiveness in enhancing DNNs through hybrid selection with uncertainty measures.

Findings

01

CBD correlates strongly with Geometric Diversity but is computationally cheaper.

02

CBD-based selection outperforms state-of-the-art baselines in improving DNNs.

03

The approach remains efficient even on large datasets like ImageNet.

Abstract

Maintaining or improving the performance of Deep Neural Networks (DNNs) through fine-tuning requires labeling newly collected inputs, a process that is often costly and time-consuming. To alleviate this problem, input selection approaches have been developed in recent years to identify small, yet highly informative subsets for labeling. Diversity-based selection is one of the most effective approaches for this purpose. However, they are often computationally intensive and lack scalability for large input sets, limiting their practical applicability. To address this challenge, we introduce Concept-Based Diversity (CBD), a highly efficient metric for image inputs that leverages Vision-Language Models (VLM). Our results show that CBD exhibits a strong correlation with Geometric Diversity (GD), an established diversity metric, while requiring only a fraction of its computation time.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsExplainable Artificial Intelligence (XAI) · Multimodal Machine Learning Applications · Domain Adaptation and Few-Shot Learning