UniStitch: Unifying Semantic and Geometric Features for Image Stitching

Yuan Mei; Lang Nie; Kang Liao; Yunqiu Xu; Chunyu Lin; Bin Xiao

arXiv:2603.10568·cs.CV·March 24, 2026

UniStitch: Unifying Semantic and Geometric Features for Image Stitching

Yuan Mei, Lang Nie, Kang Liao, Yunqiu Xu, Chunyu Lin, Bin Xiao

PDF

Open Access 1 Models 1 Datasets

TL;DR

UniStitch introduces a novel framework that unifies semantic and geometric features for improved image stitching, leveraging neural modules to combine discrete keypoints with continuous semantic maps, resulting in superior performance over existing methods.

Contribution

The paper proposes a pioneering unified framework that combines semantic and geometric features using neural modules, bridging traditional and learning-based image stitching approaches.

Findings

01

Outperforms state-of-the-art methods significantly

02

Handles complex scenes effectively

03

Demonstrates the benefit of multimodal feature fusion

Abstract

Traditional image stitching methods estimate warps from hand-crafted geometric features, whereas recent learning-based solutions leverage semantic features from neural networks instead. These two lines of research have largely diverged along separate evolution, with virtually no meaningful convergence to date. In this paper, we take a pioneering step to bridge this gap by unifying semantic and geometric features with UniStitch, a unified image stitching framework from multimodal features. To align discrete geometric features (i.e., keypoint) with continuous semantic feature maps, we present a Neural Point Transformer (NPT) module, which transforms unordered, sparse 1D geometric keypoints into ordered, dense 2D semantic maps. Then, to integrate the advantages of both representations, an Adaptive Mixture of Experts (AMoE) module is designed to fuse geometric and semantic representations.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Models

🤗
Y5Y/UniStitch_model
model

Datasets

Y5Y/Unistitch_Dataset
dataset· 19 dl
19 dl

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Image and Video Retrieval Techniques · Face recognition and analysis · Multimodal Machine Learning Applications