Semi-supervised learning using copula-based regression and model   averaging

Ziwen Gao; Huihang Liu; and Xinyu Zhang

arXiv:2411.07617·stat.ME·November 13, 2024

Semi-supervised learning using copula-based regression and model averaging

Ziwen Gao, Huihang Liu, and Xinyu Zhang

PDF

Open Access

TL;DR

This paper introduces a semi-supervised learning approach that models the regression function with copulas and marginal distributions, leveraging unlabeled data to improve estimation and prediction accuracy through model averaging.

Contribution

It proposes a novel copula-based regression framework with model averaging that effectively utilizes unlabeled data for improved semi-supervised learning.

Findings

01

Achieves faster convergence rates than supervised methods.

02

Demonstrates effectiveness on simulations and real data.

03

Provides theoretical guarantees for estimators.

Abstract

The available data in semi-supervised learning usually consists of relatively small sized labeled data and much larger sized unlabeled data. How to effectively exploit unlabeled data is the key issue. In this paper, we write the regression function in the form of a copula and marginal distributions, and the unlabeled data can be exploited to improve the estimation of the marginal distributions. The predictions based on different copulas are weighted, where the weights are obtained by minimizing an asymptotic unbiased estimator of the prediction risk. Error-ambiguity decomposition of the prediction risk is performed such that unlabeled data can be exploited to improve the prediction risk estimation. We demonstrate the asymptotic normality of copula parameters and regression function estimators of the candidate models under the semi-supervised framework, as well as the asymptotic…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsFault Detection and Control Systems