Unlabeled Data Help: Minimax Analysis and Adversarial Robustness

Yue Xing; Qifan Song; Guang Cheng

arXiv:2202.06996·stat.ML·February 16, 2022·1 cites

Unlabeled Data Help: Minimax Analysis and Adversarial Robustness

Yue Xing, Qifan Song, Guang Cheng

PDF

Open Access

TL;DR

This paper provides a minimax analysis of reconstruction-based self-supervised learning, demonstrating its rate-optimality under various models and showing that unlabeled data can enhance adversarial robustness.

Contribution

It offers the first rigorous minimax analysis for reconstruction-based SSL and integrates it with adversarial training to improve robustness.

Findings

01

Reconstruction-based SSL is rate-optimal under certain models.

02

Unlabeled data improves adversarial robustness.

03

Theoretical validation of SSL's effectiveness with unlabeled data.

Abstract

The recent proposed self-supervised learning (SSL) approaches successfully demonstrate the great potential of supplementing learning algorithms with additional unlabeled data. However, it is still unclear whether the existing SSL algorithms can fully utilize the information of both labelled and unlabeled data. This paper gives an affirmative answer for the reconstruction-based SSL algorithm \citep{lee2020predicting} under several statistical models. While existing literature only focuses on establishing the upper bound of the convergence rate, we provide a rigorous minimax analysis, and successfully justify the rate-optimality of the reconstruction-based SSL algorithm under different data generation models. Furthermore, we incorporate the reconstruction-based SSL into the existing adversarial training algorithms and show that learning from unlabeled data helps improve the robustness.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning and Data Classification · Advanced Statistical Methods and Models · Domain Adaptation and Few-Shot Learning