A universal linearized subspace refinement framework for neural networks
Wenbo Cao, Weiwei Zhang

TL;DR
The paper introduces Linearized Subspace Refinement (LSR), a universal framework that improves neural network accuracy by solving a linear residual problem at a fixed network state, surpassing standard training methods without altering architectures.
Contribution
It presents a novel, architecture-agnostic refinement method that leverages linear residual models to enhance neural network predictions beyond traditional gradient-based training.
Findings
LSR achieves order-of-magnitude error reductions in various tasks.
Gradient-based training often fails to reach the attainable accuracy within the model's capacity.
Iterative LSR accelerates convergence and improves accuracy in operator-constrained problems.
Abstract
Neural networks are predominantly trained using gradient-based methods, yet in many applications their final predictions remain far from the accuracy attainable within the model's expressive capacity. We introduce Linearized Subspace Refinement (LSR), a general and architecture-agnostic framework that exploits the Jacobian-induced linear residual model at a fixed trained network state. By solving a reduced direct least-squares problem within this subspace, LSR computes a subspace-optimal solution of the linearized residual model, yielding a refined linear predictor with substantially improved accuracy over standard gradient-trained solutions, without modifying network architectures, loss formulations, or training procedures. Across supervised function approximation, data-driven operator learning, and physics-informed operator fine-tuning, we show that gradient-based training often fails…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsModel Reduction and Neural Networks · Stochastic Gradient Optimization Techniques · Neural Networks and Reservoir Computing
