Transparent Object Tracking with Enhanced Fusion Module
Kalyan Garigapati, Erik Blasch, Jie Wei, Haibin Ling

TL;DR
This paper introduces a novel feature fusion module with a transformer encoder and MLP that embeds transparency information into fixed feature spaces, improving transparent object tracking performance.
Contribution
The paper proposes a new fusion technique and training strategy that enable transparency awareness in fixed latent space trackers, expanding their applicability.
Findings
Achieves superior results on TOTB benchmark
Compatible with pre-trained transformer-based trackers
Code and results will be publicly available
Abstract
Accurate tracking of transparent objects, such as glasses, plays a critical role in many robotic tasks such as robot-assisted living. Due to the adaptive and often reflective texture of such objects, traditional tracking algorithms that rely on general-purpose learned features suffer from reduced performance. Recent research has proposed to instill transparency awareness into existing general object trackers by fusing purpose-built features. However, with the existing fusion techniques, the addition of new features causes a change in the latent space making it impossible to incorporate transparency awareness on trackers with fixed latent spaces. For example, many of the current days transformer-based trackers are fully pre-trained and are sensitive to any latent space perturbations. In this paper, we present a new feature fusion technique that integrates transparency information into a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsVideo Surveillance and Tracking Methods · Advanced Image and Video Retrieval Techniques · Face recognition and analysis
