Multimodal Fusion Image Stabilization Algorithm for Bio-Inspired Flapping-Wing Aircraft
Zhikai Wang, Sen Wang, Yiwen Hu, Yangfan Zhou, Na Li, Xiaofeng Zhang

TL;DR
This paper introduces FWStab, a dataset and framework for stabilizing videos from flapping-wing aircraft by combining sensor data and images.
Contribution
The novel contribution is a multimodal fusion framework for video stabilization using IMU data and images, trained unsupervised with a joint loss function.
Findings
FWStab dataset includes 48 video clips with synchronized IMU data for multimodal modeling.
The proposed framework improves inter-frame stability and avoids visual artifacts from traditional methods.
Using LSTM and a joint loss function, the framework achieves high-precision trajectory prediction.
Abstract
This paper presents FWStab, a specialized video stabilization dataset tailored for flapping-wing platforms. The dataset encompasses five typical flight scenarios, featuring 48 video clips with intense dynamic jitter. The corresponding Inertial Measurement Unit (IMU) sensor data are synchronously collected, which jointly provide reliable support for multimodal modeling. Based on this, to address the issue of poor image acquisition quality due to severe vibrations in aerial vehicles, this paper proposes a multi-modal signal fusion video stabilization framework. This framework effectively integrates image features and inertial sensor features to predict smooth and stable camera poses. During the video stabilization process, the true camera motion originally estimated based on sensors is warped to the smooth trajectory predicted by the network, thereby optimizing the inter-frame stability.…
Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.
Click any figure to enlarge with its caption.
Figure 1
Figure 2
Figure 3
Figure 4
Figure 5
Figure 6
Figure 7
Figure 8
Figure 9
Figure 10
Figure 11
Figure 12Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsImage and Video Stabilization · Advanced Vision and Imaging · Image Processing Techniques and Applications
