Look Ahead or Look Around? A Theoretical Comparison Between Autoregressive and Masked Pretraining
Qi Zhang, Tianqi Du, Haotian Huang, Yifei Wang, Yisen Wang

TL;DR
This paper provides a theoretical comparison of autoregressive and masked self-supervised learning paradigms, revealing their respective strengths and limitations in classification and content generation tasks, and proposing hybrid objectives to enhance performance.
Contribution
It introduces the first theoretical framework comparing autoregressive and masked SSL, and proposes hybrid objectives to leverage their complementary strengths.
Findings
Masked SSL fosters better inter-sample connections in classification.
Autoregressive SSL achieves superior clustering performance.
Hybrid objectives improve both classification and generation performance.
Abstract
In recent years, the rise of generative self-supervised learning (SSL) paradigms has exhibited impressive performance across visual, language, and multi-modal domains. While the varied designs of generative SSL objectives lead to distinct properties in downstream tasks, a theoretical understanding of these differences remains largely unexplored. In this paper, we establish the first theoretical comparisons between two leading generative SSL paradigms: autoregressive SSL and masked SSL. Through establishing theoretical frameworks, we elucidate the strengths and limitations of autoregressive and masked SSL within the primary evaluation tasks of classification and content generation. Our findings demonstrate that in classification tasks, the flexibility of targeted tokens in masked SSL fosters more inter-sample connections compared to the fixed position of target tokens in autoregressive…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTeacher Education and Leadership Studies · Educational Environments and Student Outcomes
