Masked prediction tasks: a parameter identifiability view
Bingbin Liu, Daniel Hsu, Pradeep Ravikumar, Andrej Risteski

TL;DR
This paper investigates the conditions under which masked prediction tasks in self-supervised learning can uniquely identify the underlying parameters of latent-variable models, providing a theoretical perspective on model identifiability.
Contribution
It introduces a parameter identifiability framework for self-supervised masked prediction tasks, especially for Hidden Markov Models, linking it to tensor rank decompositions.
Findings
Identifiability depends on the specific prediction task used.
Some masked prediction tasks can recover true model parameters.
Connections established between self-supervised learning and tensor decomposition theory.
Abstract
The vast majority of work in self-supervised learning, both theoretical and empirical (though mostly the latter), have largely focused on recovering good features for downstream tasks, with the definition of "good" often being intricately tied to the downstream task itself. This lens is undoubtedly very interesting, but suffers from the problem that there isn't a "canonical" set of downstream tasks to focus on -- in practice, this problem is usually resolved by competing on the benchmark dataset du jour. In this paper, we present an alternative lens: one of parameter identifiability. More precisely, we consider data coming from a parametric probabilistic model, and train a self-supervised learning predictor with a suitably chosen parametric form. Then, we ask whether we can read off the ground truth parameters of the probabilistic model from the optimal predictor. We focus on the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and Data Classification · Topic Modeling · Anomaly Detection Techniques and Applications
