Curriculum Learning with Diversity for Supervised Computer Vision Tasks
Petru Soviany

TL;DR
This paper proposes a novel curriculum sampling strategy that combines data diversity and difficulty estimation based on human visual search time, improving training efficiency and accuracy in computer vision tasks.
Contribution
Introduces a new curriculum sampling method considering data diversity and human-based difficulty metrics, outperforming standard curriculum and random sampling in object detection and segmentation.
Findings
Outperforms baseline and standard curriculum methods on Pascal VOC 2007 and Cityscapes.
Achieves faster convergence and higher accuracy on unbalanced datasets.
Effective in improving training efficiency for unbalanced data.
Abstract
Curriculum learning techniques are a viable solution for improving the accuracy of automatic models, by replacing the traditional random training with an easy-to-hard strategy. However, the standard curriculum methodology does not automatically provide improved results, but it is constrained by multiple elements like the data distribution or the proposed model. In this paper, we introduce a novel curriculum sampling strategy which takes into consideration the diversity of the training data together with the difficulty of the inputs. We determine the difficulty using a state-of-the-art estimator based on the human time required for solving a visual search task. We consider this kind of difficulty metric to be better suited for solving general problems, as it is not based on certain task-dependent elements, but more on the context of each image. We ensure the diversity during training,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and Data Classification · Machine Learning and Algorithms · Domain Adaptation and Few-Shot Learning
