Selfish Evolution: Making Discoveries in Extreme Label Noise with the   Help of Overfitting Dynamics

Nima Sedaghat; Tanawan Chatchadanoraset; Colin Orion Chandler; Ashish; Mahabal; Maryam Eslami

arXiv:2412.00077·cs.CV·December 3, 2024

Selfish Evolution: Making Discoveries in Extreme Label Noise with the Help of Overfitting Dynamics

Nima Sedaghat, Tanawan Chatchadanoraset, Colin Orion Chandler, Ashish, Mahabal, Maryam Eslami

PDF

Open Access

TL;DR

This paper introduces Selfish Evolution, a novel method that leverages overfitting dynamics during training to detect and correct label noise in weakly supervised datasets, demonstrated on astrophysical and standard datasets.

Contribution

The paper presents a new technique that uses model overfitting patterns to identify and correct corrupted labels without prior assumptions, applicable in weak supervision.

Findings

01

Effective label correction in astrophysical data

02

Demonstrated success on MNIST dataset

03

Automatic convergence to cleaner datasets

Abstract

Motivated by the scarcity of proper labels in an astrophysical application, we have developed a novel technique, called Selfish Evolution, which allows for the detection and correction of corrupted labels in a weakly supervised fashion. Unlike methods based on early stopping, we let the model train on the noisy dataset. Only then do we intervene and allow the model to overfit to individual samples. The ``evolution'' of the model during this process reveals patterns with enough information about the noisiness of the label, as well as its correct version. We train a secondary network on these spatiotemporal ``evolution cubes'' to correct potentially corrupted labels. We incorporate the technique in a closed-loop fashion, allowing for automatic convergence towards a mostly clean dataset, without presumptions about the state of the network in which we intervene. We evaluate on the main task…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMusic Technology and Sound Studies