TL;DR
MC-RFM introduces a geometry-aware framework for few-shot adaptation of frozen vision models, leveraging mixed-curvature manifolds to better model feature displacements and improve performance across benchmarks.
Contribution
It proposes a novel mixed-curvature Riemannian flow-matching approach that explicitly models feature geometry for more effective few-shot adaptation.
Findings
Outperforms existing methods on multiple benchmarks and backbones.
Strongest gains observed on Transformer backbones and fine-grained datasets.
Ablation studies confirm the importance of geometry-aware components.
Abstract
Parameter-efficient adaptation of pretrained vision models is commonly performed through linear probes, prompts, low-rank updates, or lightweight residual modules. While effective, these methods usually treat adaptation as a discrete Euclidean perturbation of frozen representations, without explicitly modeling the geometry of the task-induced feature displacement. We propose \textsc{MC-RFM}, a mixed-curvature Riemannian flow-matching framework for few-shot adaptation of frozen visual backbones. The key idea is to represent adapted features on a product manifold combining a hyperbolic factor, which captures hierarchy-sensitive semantic structure, and a Euclidean factor, which preserves locally discriminative visual variation. Adaptation is formulated as a task-conditioned continuous transport from frozen features to support-set prototypes, trained with a flow-matching objective and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
