TL;DR
The paper introduces SOR-Track, a novel RGB-Event fusion framework using spatial orthogonal refinement to improve robust object tracking under challenging motion and lighting conditions.
Contribution
It explicitly leverages geometric priors in event streams with orthogonal directional filters for improved multi-modal feature alignment.
Findings
Outperforms existing fusion-based trackers on FE108 benchmark.
Excels in motion blur and low-light scenarios.
Provides a physics-grounded approach to texture rectification.
Abstract
Robust visual object tracking (VOT) remains challenging in high-speed motion scenarios, where conventional RGB sensors suffer from severe motion blur and performance degradation. Event cameras, with microsecond temporal resolution and high dynamic range, provide complementary structural cues that can potentially compensate for these limitations. However, existing RGB-Event fusion methods typically treat event data as dense intensity representations and adopt black-box fusion strategies, failing to explicitly leverage the directional geometric priors inherently encoded in event streams to rectify degraded RGB features. To address this limitation, we propose SOR-Track, a streamlined framework for robust RGB-Event tracking based on Spatial Orthogonal Refinement (SOR). The core SOR module employs a set of orthogonal directional filters that are dynamically guided by local motion…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
