Self-Guided Masked Autoencoders for Domain-Agnostic Self-Supervised   Learning

Johnathan Xie; Yoonho Lee; Annie S. Chen; Chelsea Finn

arXiv:2402.14789·cs.LG·February 23, 2024·1 cites

Self-Guided Masked Autoencoders for Domain-Agnostic Self-Supervised Learning

Johnathan Xie, Yoonho Lee, Annie S. Chen, Chelsea Finn

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper introduces Self-guided Masked Autoencoders (SMA), a domain-agnostic self-supervised learning method that learns masks without domain-specific assumptions, achieving state-of-the-art results across diverse scientific data modalities.

Contribution

SMA is the first fully domain-agnostic masked modeling approach that learns masks via attention, removing the need for domain-specific augmentations or assumptions.

Findings

01

Achieves state-of-the-art performance on protein biology benchmarks

02

Effective in chemical property prediction tasks

03

Performs well in particle physics data analysis

Abstract

Self-supervised learning excels in learning representations from large amounts of unlabeled data, demonstrating success across multiple data modalities. Yet, extending self-supervised learning to new modalities is non-trivial because the specifics of existing methods are tailored to each domain, such as domain-specific augmentations which reflect the invariances in the target task. While masked modeling is promising as a domain-agnostic framework for self-supervised learning because it does not rely on input augmentations, its mask sampling procedure remains domain-specific. We present Self-guided Masked Autoencoders (SMA), a fully domain-agnostic masked modeling method. SMA trains an attention based model using a masked modeling objective, by learning masks to sample without any domain-specific assumptions. We evaluate SMA on three self-supervised learning benchmarks in protein…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

johnathan-xie/sma
jaxOfficial

Videos

Self-Guided Masked Autoencoders for Domain-Agnostic Self-Supervised Learning· slideslive

Taxonomy

TopicsDomain Adaptation and Few-Shot Learning

MethodsSlime Mould Algorithm