Neural Network Surgery with Sets
Jonathan Raiman, Susan Zhang, Christy Dennison

TL;DR
This paper introduces a set-based approach for neural network surgery, enabling automatic identification of which parts of a model to salvage or retrain after architecture or feature set modifications, demonstrated on OpenAI Five.
Contribution
The paper proposes a novel set-based methodology for neural network surgery that automates the process of determining salvageable components after architecture changes.
Findings
Successfully transferred weights across feature and architecture modifications in OpenAI Five.
Derived two methods for detecting feature-parameter interaction maps and proved their equivalence.
Validated the approach empirically on the OpenAI Five model.
Abstract
The cost to train machine learning models has been increasing exponentially, making exploration and research into the correct features and architecture a costly or intractable endeavor at scale. However, using a technique named "surgery" OpenAI Five was continuously trained to play the game DotA 2 over the course of 10 months through 20 major changes in features and architecture. Surgery transfers trained weights from one network to another after a selection process to determine which sections of the model are unchanged and which must be re-initialized. In the past, the selection process relied on heuristics, manual labor, or pre-existing boundaries in the structure of the model, limiting the ability to salvage experiments after modifications of the feature set or input reorderings. We propose a solution to automatically determine which components of a neural network model should be…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeural Networks and Applications · Explainable Artificial Intelligence (XAI) · Adversarial Robustness in Machine Learning
