SynBench: Task-Agnostic Benchmarking of Pretrained Representations using   Synthetic Data

Ching-Yun Ko; Pin-Yu Chen; Jeet Mohapatra; Payel Das; Luca Daniel

arXiv:2210.02989·cs.LG·October 10, 2022·1 cites

SynBench: Task-Agnostic Benchmarking of Pretrained Representations using Synthetic Data

Ching-Yun Ko, Pin-Yu Chen, Jeet Mohapatra, Payel Das, Luca Daniel

PDF

Open Access

TL;DR

SynBench is a task-agnostic benchmarking framework that evaluates the quality of pretrained representations using synthetic data based on a theoretical robustness-accuracy tradeoff, applicable across various models and tasks.

Contribution

It introduces a novel, task-agnostic benchmarking method for pretrained models using synthetic data and a theoretical robustness-accuracy tradeoff as a reference.

Findings

01

SynBench scores correlate well with actual downstream performance.

02

Framework can guide robust linear probing to improve model robustness.

03

Applicable to diverse pretrained models and data types.

Abstract

Recent success in fine-tuning large models, that are pretrained on broad data at scale, on downstream tasks has led to a significant paradigm shift in deep learning, from task-centric model design to task-agnostic representation learning and task-specific fine-tuning. As the representations of pretrained models are used as a foundation for different downstream tasks, this paper proposes a new task-agnostic framework, \textit{SynBench}, to measure the quality of pretrained representations using synthetic data. We set up a reference by a theoretically-derived robustness-accuracy tradeoff of the class conditional Gaussian mixture. Given a pretrained model, the representations of data synthesized from the Gaussian mixture are used to compare with our reference to infer the quality. By comparing the ratio of area-under-curve between the raw data and their representations, SynBench offers a…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDomain Adaptation and Few-Shot Learning · Advanced Neural Network Applications · Adversarial Robustness in Machine Learning

MethodsAttention Is All You Need · Linear Layer · Softmax · Dense Connections · Layer Normalization · Residual Connection · Multi-Head Attention · Vision Transformer