CRSOT: Cross-Resolution Object Tracking using Unaligned Frame and Event Cameras
Yabin Zhu, Xiao Wang, Chenglong Li, Bo Jiang, Lin Zhu, Zhixiang Huang,, Yonghong Tian, Jin Tang

TL;DR
This paper introduces CRSOT, a novel framework for object tracking using unaligned high-definition neuromorphic and visible cameras, supported by a new dataset and a unified ViT-based tracking approach.
Contribution
The paper presents the first unaligned frame-event dataset CRSOT and a robust tracking framework that effectively fuses RGB and event data without strict alignment.
Findings
Effective tracking with unaligned RGB and event data
High performance on the CRSOT dataset
Framework generalizes well to different sensor resolutions
Abstract
Existing datasets for RGB-DVS tracking are collected with DVS346 camera and their resolution () is low for practical applications. Actually, only visible cameras are deployed in many practical systems, and the newly designed neuromorphic cameras may have different resolutions. The latest neuromorphic sensors can output high-definition event streams, but it is very difficult to achieve strict alignment between events and frames on both spatial and temporal views. Therefore, how to achieve accurate tracking with unaligned neuromorphic and visible sensors is a valuable but unresearched problem. In this work, we formally propose the task of object tracking using unaligned neuromorphic and visible cameras. We build the first unaligned frame-event dataset CRSOT collected with a specially built data acquisition system, which contains 1,030 high-definition RGB-Event video pairs,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Memory and Neural Computing · CCD and CMOS Imaging Sensors · Neural dynamics and brain function
