On Model Stability as a Function of Random Seed

Pranava Madhyastha; Rishabh Jain

arXiv:1909.10447·cs.LG·September 24, 2019

On Model Stability as a Function of Random Seed

Pranava Madhyastha, Rishabh Jain

PDF

1 Repo

TL;DR

This paper investigates how random seed choices affect model stability and interpretability, and introduces techniques to enhance robustness and consistency across different random initializations.

Contribution

It presents ASWA and NASWA methods that significantly improve model stability against random seed variations, reducing performance variability.

Findings

01

Random seeds can cause significant variability in model interpretations.

02

ASWA and NASWA techniques improve model robustness and reduce performance variance.

03

On average, these methods reduce the standard deviation of model performance by 72%.

Abstract

In this paper, we focus on quantifying model stability as a function of random seed by investigating the effects of the induced randomness on model performance and the robustness of the model in general. We specifically perform a controlled study on the effect of random seeds on the behaviour of attention, gradient-based and surrogate model based (LIME) interpretations. Our analysis suggests that random seeds can adversely affect the consistency of models resulting in counterfactual interpretations. We propose a technique called Aggressive Stochastic Weight Averaging (ASWA)and an extension called Norm-filtered Aggressive Stochastic Weight Averaging (NASWA) which improves the stability of models over random seeds. With our ASWA and NASWA based optimization, we are able to improve the robustness of the original model, on average reducing the standard deviation of the model's performance…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

rishj97/ModelStability
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsStochastic Weight Averaging