Reprogramming Language Models for Molecular Representation Learning

Ria Vinod; Pin-Yu Chen; Payel Das

arXiv:2012.03460·cs.LG·January 7, 2021·1 cites

Reprogramming Language Models for Molecular Representation Learning

Ria Vinod, Pin-Yu Chen, Payel Das

PDF

Open Access

TL;DR

This paper introduces R2DL, a novel adversarial reprogramming algorithm that leverages dictionary learning to adapt pretrained language models for molecular tasks, outperforming existing models especially with limited data.

Contribution

The paper proposes R2DL, a new method for reprogramming language models to molecular tasks using dictionary learning, enabling effective transfer learning across domains.

Findings

01

R2DL matches state-of-the-art toxicity prediction models.

02

R2DL outperforms baselines with limited training data.

03

Demonstrates domain-agnostic transfer learning for molecular data.

Abstract

Recent advancements in transfer learning have made it a promising approach for domain adaptation via transfer of learned representations. This is especially when relevant when alternate tasks have limited samples of well-defined and labeled data, which is common in the molecule data domain. This makes transfer learning an ideal approach to solve molecular learning tasks. While Adversarial reprogramming has proven to be a successful method to repurpose neural networks for alternate tasks, most works consider source and alternate tasks within the same domain. In this work, we propose a new algorithm, Representation Reprogramming via Dictionary Learning (R2DL), for adversarially reprogramming pretrained language models for molecular learning tasks, motivated by leveraging learned representations in massive state of the art language models. The adversarial program learns a linear…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Machine Learning in Materials Science · Domain Adaptation and Few-Shot Learning