Data Distribution-based Curriculum Learning
Shonal Chaudhry, Anuraganand Sharma

TL;DR
This paper introduces Data Distribution-based Curriculum Learning (DDCL), a novel method that orders training samples based on data distribution to improve classifier performance and convergence speed.
Contribution
The paper proposes DDCL, a new curriculum learning approach utilizing data distribution for sample ordering, with two scoring methods, enhancing classification accuracy and training efficiency.
Findings
DDCL improves average classification accuracy across multiple datasets.
DDCL accelerates convergence during training.
Both scoring methods outperform no-curriculum baseline.
Abstract
The order of training samples can have a significant impact on the performance of a classifier. Curriculum learning is a method of ordering training samples from easy to hard. This paper proposes the novel idea of a curriculum learning approach called Data Distribution-based Curriculum Learning (DDCL). DDCL uses the data distribution of a dataset to build a curriculum based on the order of samples. Two types of scoring methods known as DDCL (Density) and DDCL (Point) are used to score training samples thus determining their training order. DDCL (Density) uses the sample density to assign scores while DDCL (Point) utilises the Euclidean distance for scoring. We evaluate the proposed DDCL approach by conducting experiments on multiple datasets using a neural network, support vector machine and random forest classifier. Evaluation results show that the application of DDCL improves the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStatistics Education and Methodologies
