Hypothesis Testing for Class-Conditional Noise Using Local Maximum Likelihood
Weisong Yang, Rafael Poyiadzi, Niall Twomey, Raul Santos Rodriguez

TL;DR
This paper introduces a nonparametric hypothesis testing method for class-conditional label noise using local maximum likelihood, offering more flexible models than traditional parametric approaches.
Contribution
It extends existing hypothesis testing techniques to nonparametric logistic regression, reducing model misspecification issues and broadening applicability.
Findings
The nonparametric approach performs well on synthetic data.
It demonstrates advantages over parametric methods in real-world case studies.
The method is less susceptible to model misspecification.
Abstract
In supervised learning, automatically assessing the quality of the labels before any learning takes place remains an open research question. In certain particular cases, hypothesis testing procedures have been proposed to assess whether a given instance-label dataset is contaminated with class-conditional label noise, as opposed to uniform label noise. The existing theory builds on the asymptotic properties of the Maximum Likelihood Estimate for parametric logistic regression. However, the parametric assumptions on top of which these approaches are constructed are often too strong and unrealistic in practice. To alleviate this problem, in this paper we propose an alternative path by showing how similar procedures can be followed when the underlying model is a product of Local Maximum Likelihood Estimation that leads to more flexible nonparametric logistic regression models, which in…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and Data Classification · Advanced Statistical Methods and Models · Advanced Statistical Process Monitoring
MethodsLogistic Regression
