Model Repair: Robust Recovery of Over-Parameterized Statistical Models
Chao Gao, John Lafferty

TL;DR
This paper introduces a robust model repair framework for over-parameterized models, enabling recovery after corruption using only design data, with theoretical insights and empirical validation across various model types.
Contribution
It develops a novel theory for repairing over-parameterized models, highlighting the importance of redundancy and over-parameterization, and demonstrates repair methods for neural networks and other models.
Findings
Over-parameterization is essential for model repair.
SGD-based estimators are effective for repair.
Sparse estimators are generally not repairable.
Abstract
A new type of robust estimation problem is introduced where the goal is to recover a statistical model that has been corrupted after it has been estimated from data. Methods are proposed for "repairing" the model using only the design and not the response values used to fit the model in a supervised learning setting. Theory is developed which reveals that two important ingredients are necessary for model repair---the statistical model must be over-parameterized, and the estimator must incorporate redundancy. In particular, estimators based on stochastic gradient descent are seen to be well suited to model repair, but sparse estimators are not in general repairable. After formulating the problem and establishing a key technical lemma related to robust estimation, a series of results are presented for repair of over-parameterized linear models, random feature models, and artificial neural…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStatistical Methods and Inference · Advanced Statistical Methods and Models · Machine Learning and Algorithms
MethodsRepair
