Few-Shot LoRA Adaptation of a Flow-Matching Foundation Model for Cross-Spectral Object Detection
Maxim Clouser, Kia Khezeli, John Kalantari

TL;DR
This paper demonstrates that a flow-matching foundation model pre-trained on RGB images can be efficiently adapted with few-shot learning to generate cross-spectral images, improving object detection in infrared and SAR modalities.
Contribution
The study introduces a novel few-shot LoRA adaptation method for flow-matching models, enabling cross-spectral translation with minimal data and enhancing detection performance in non-visible modalities.
Findings
LoRA hyperparameters correlate with downstream detection performance.
Synthetic IR and SAR data improve pedestrian and infrastructure detection.
LPIPS metric predicts the effectiveness of the adapted models.
Abstract
Foundation models for vision are predominantly trained on RGB data, while many safety-critical applications rely on non-visible modalities such as infrared (IR) and synthetic aperture radar (SAR). We study whether a single flow-matching foundation model pre-trained primarily on RGB images can be repurposed as a cross-spectral translator using only a few co-measured examples, and whether the resulting synthetic data can enhance downstream detection. Starting from FLUX.1 Kontext, we insert low-rank adaptation (LoRA) modules and fine-tune them on just 100 paired images per domain for two settings: RGB to IR on the KAIST dataset and RGB to SAR on the M4-SAR dataset. The adapted model translates RGB images into pixel-aligned IR/SAR, enabling us to reuse existing bounding boxes and train object detection models purely in the target modality. Across a grid of LoRA hyperparameters, we find that…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Domain Adaptation and Few-Shot Learning · Advanced SAR Imaging Techniques
