ModelPred: A Framework for Predicting Trained Model from Training Data

Yingyan Zeng; Jiachen T. Wang; Si Chen; Hoang Anh Just; Ran Jin; Ruoxi; Jia

arXiv:2111.12545·cs.LG·December 27, 2022

ModelPred: A Framework for Predicting Trained Model from Training Data

Yingyan Zeng, Jiachen T. Wang, Si Chen, Hoang Anh Just, Ran Jin, Ruoxi, Jia

PDF

Open Access 1 Repo

TL;DR

ModelPred is a neural network-based framework that predicts trained model parameters from training data, enhancing interpretability, data valuation, and model calibration in machine learning workflows.

Contribution

It introduces a novel neural set function approach with regularization techniques to directly predict model parameters from training data, differing from existing behavior-based models.

Findings

01

Effective in predicting model parameters across various datasets

02

Improves interpretability and accountability in ML workflows

03

Enables applications like data valuation and model calibration

Abstract

In this work, we propose ModelPred, a framework that helps to understand the impact of changes in training data on a trained model. This is critical for building trust in various stages of a machine learning pipeline: from cleaning poor-quality samples and tracking important ones to be collected during data preparation, to calibrating uncertainty of model prediction, to interpreting why certain behaviors of a model emerge during deployment. Specifically, ModelPred learns a parameterized function that takes a dataset $S$ as the input and predicts the model obtained by training on $S$ . Our work differs from the recent work of Datamodels [1] as we aim for predicting the trained model parameters directly instead of the trained model behaviors. We demonstrate that a neural network-based set function class is capable of learning the complex relationships between the training data and model…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

yyzeng43/ModelPred
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Explainable Artificial Intelligence (XAI) · Machine Learning and Data Classification