Curvature Tuning: Provable Training-free Model Steering From a Single Parameter
Leyang Hu, Matteo Gamba, Randall Balestriero

TL;DR
Curvature Tuning (CT) is a training-free, interpretable method that adjusts model decision boundaries by modifying activation functions, leading to improved accuracy and robustness with minimal hyperparameters.
Contribution
This paper introduces Curvature Tuning, a novel approach that shifts focus from weight adaptation to activation function modulation for model steering, with provable effects and high parameter efficiency.
Findings
CT improves downstream accuracy by over 8% on ResNet models.
CT enhances robustness significantly on $\, ext{l}_ ext{infty}$ benchmarks.
Hyperparameter tuning in CT yields better generalization and robustness.
Abstract
The scaling of model and data sizes has reshaped the AI landscape, establishing finetuning pretrained models as the standard paradigm for solving downstream tasks. However, dominant finetuning methods typically rely on weight adaptation, often lack interpretability, and depend on heuristically chosen hyperparameters. In this paper, we take a different perspective and shift the focus from weights to activation functions, viewing them through the lens of spline operators. We propose Curvature Tuning (CT), an interpretable and principled steering method that modulates a model's decision boundary by injecting a single hyperparameter into its activation functions. We show that CT provably adjusts model decision boundary curvature and, more fundamentally, projects a model onto a space of smooth functions-thereby complementing current finetuning methods, whose effect lies primarily in feature…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsSoft Robotics and Applications
