A Novel Bounding Box Regression Method for Single Object Tracking

Omar Abdelaziz; Mohamed Sami Shehata

arXiv:2405.10444·cs.CV·May 20, 2024

A Novel Bounding Box Regression Method for Single Object Tracking

Omar Abdelaziz, Mohamed Sami Shehata

PDF

Open Access

TL;DR

This paper introduces two novel bounding box regression networks, inception and deformable, demonstrating their effectiveness in improving single object tracking accuracy across multiple benchmarks.

Contribution

The work highlights the importance of receptive field design in bounding box regression and proposes two new networks that outperform existing methods.

Findings

01

Inception module outperforms deformable on three benchmarks.

02

Receptive field size significantly impacts bounding box accuracy.

03

Proposed methods improve tracking performance over state-of-the-art.

Abstract

Locating an object in a sequence of frames, given its appearance in the first frame of the sequence, is a hard problem that involves many stages. Usually, state-of-the-art methods focus on bringing novel ideas in the visual encoding or relational modelling phases. However, in this work, we show that bounding box regression from learned joint search and template features is of high importance as well. While previous methods relied heavily on well-learned features representing interactions between search and template, we hypothesize that the receptive field of the input convolutional bounding box network plays an important role in accurately determining the object location. To this end, we introduce two novel bounding box regression networks: inception and deformable. Experiments and ablation studies show that our inception module installed on the recent ODTrack outperforms the latter on…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Measurement and Detection Methods

MethodsConvolution · 1x1 Convolution · Max Pooling · Focus · Inception Module