Transparent Object Tracking with Enhanced Fusion Module

Kalyan Garigapati; Erik Blasch; Jie Wei; Haibin Ling

arXiv:2309.06701·cs.CV·September 14, 2023·1 cites

Transparent Object Tracking with Enhanced Fusion Module

Kalyan Garigapati, Erik Blasch, Jie Wei, Haibin Ling

PDF

Open Access 1 Repo

TL;DR

This paper introduces a novel feature fusion module with a transformer encoder and MLP that embeds transparency information into fixed feature spaces, improving transparent object tracking performance.

Contribution

The paper proposes a new fusion technique and training strategy that enable transparency awareness in fixed latent space trackers, expanding their applicability.

Findings

01

Achieves superior results on TOTB benchmark

02

Compatible with pre-trained transformer-based trackers

03

Code and results will be publicly available

Abstract

Accurate tracking of transparent objects, such as glasses, plays a critical role in many robotic tasks such as robot-assisted living. Due to the adaptive and often reflective texture of such objects, traditional tracking algorithms that rely on general-purpose learned features suffer from reduced performance. Recent research has proposed to instill transparency awareness into existing general object trackers by fusing purpose-built features. However, with the existing fusion techniques, the addition of new features causes a change in the latent space making it impossible to incorporate transparency awareness on trackers with fixed latent spaces. For example, many of the current days transformer-based trackers are fully pre-trained and are sensitive to any latent space perturbations. In this paper, we present a new feature fusion technique that integrates transparency information into a…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

kalyan0510/totem
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsVideo Surveillance and Tracking Methods · Advanced Image and Video Retrieval Techniques · Face recognition and analysis