ModelDiff: A Framework for Comparing Learning Algorithms

Harshay Shah; Sung Min Park; Andrew Ilyas; Aleksander Madry

arXiv:2211.12491·cs.LG·November 23, 2022·5 cites

ModelDiff: A Framework for Comparing Learning Algorithms

Harshay Shah, Sung Min Park, Andrew Ilyas, Aleksander Madry

PDF

Open Access 1 Repo 1 Video

TL;DR

ModelDiff is a framework that compares learning algorithms by identifying feature transformations that reveal differences in model predictions, helping understand how algorithms utilize training data.

Contribution

The paper introduces ModelDiff, a novel method leveraging the datamodels framework to compare learning algorithms based on their data usage and prediction differences.

Findings

01

Effective in distinguishing models trained with different data augmentation

02

Able to compare models with/without pre-training

03

Identifies differences due to hyperparameter variations

Abstract

We study the problem of (learning) algorithm comparison, where the goal is to find differences between models trained with two different learning algorithms. We begin by formalizing this goal as one of finding distinguishing feature transformations, i.e., input transformations that change the predictions of models trained with one learning algorithm but not the other. We then present ModelDiff, a method that leverages the datamodels framework (Ilyas et al., 2022) to compare learning algorithms based on how they use their training data. We demonstrate ModelDiff through three case studies, comparing models trained with/without data augmentation, with/without pre-training, and with different SGD hyperparameters. Our code is available at https://github.com/MadryLab/modeldiff .

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

madrylab/modeldiff
pytorchOfficial

Videos

ModelDiff: A Framework for Comparing Learning Algorithms· slideslive

Taxonomy

TopicsMachine Learning and Data Classification · Time Series Analysis and Forecasting · Multidisciplinary Science and Engineering Research

MethodsStochastic Gradient Descent