FoRA: Low-Rank Adaptation Model beyond Multimodal Siamese Network

Weiying Xie; Yusi Zhang; Tianlin Hui; Jiaqing Zhang; Jie; Lei; Yunsong Li

arXiv:2407.16129·cs.CV·July 24, 2024

FoRA: Low-Rank Adaptation Model beyond Multimodal Siamese Network

Weiying Xie, Yusi Zhang, Tianlin Hui, Jiaqing Zhang, Jie, Lei, Yunsong Li

PDF

1 Repo

TL;DR

FoRA introduces Low-rank Modal Adaptors with a shared backbone and adaptive rank allocation for multimodal object detection, achieving significant accuracy improvements and parameter reduction.

Contribution

The paper proposes a novel low-rank adaptation model with shared backbone and adaptive rank strategy for multimodal detection, addressing complex fusion and parameter issues.

Findings

01

10.4% accuracy improvement on DroneVehicle dataset

02

149M fewer parameters compared to state-of-the-art

03

Effective handling of data heterogeneity at feature levels

Abstract

Multimodal object detection offers a promising prospect to facilitate robust detection in various visual conditions. However, existing two-stream backbone networks are challenged by complex fusion and substantial parameter increments. This is primarily due to large data distribution biases of multimodal homogeneous information. In this paper, we propose a novel multimodal object detector, named Low-rank Modal Adaptors (LMA) with a shared backbone. The shared parameters enhance the consistency of homogeneous information, while lightweight modal adaptors focus on modality unique features. Furthermore, we design an adaptive rank allocation strategy to adapt to the varying heterogeneity at different feature levels. When applied to two multimodal object detection datasets, experiments validate the effectiveness of our method. Notably, on DroneVehicle, LMA attains a 10.4% accuracy improvement…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

zyszxhy/fora
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsFocus