TL;DR
This paper introduces rotation-equivariant Siamese networks (RE-SiamNets) that effectively handle in-plane rotations in visual object tracking, outperforming regular CNN-based trackers and providing unsupervised pose change estimation.
Contribution
The paper proposes RE-SiamNets with group-equivariant convolutional layers to improve rotation robustness in tracking and introduces a new Rotation Tracking Benchmark dataset.
Findings
RE-SiamNets outperform regular Siamese trackers on rotation instances
RE-SiamNets accurately estimate relative in-plane rotation in an unsupervised manner
The approach effectively incorporates rotation constraints into tracking
Abstract
Rotation is among the long prevailing, yet still unresolved, hard challenges encountered in visual object tracking. The existing deep learning-based tracking algorithms use regular CNNs that are inherently translation equivariant, but not designed to tackle rotations. In this paper, we first demonstrate that in the presence of rotation instances in videos, the performance of existing trackers is severely affected. To circumvent the adverse effect of rotations, we present rotation-equivariant Siamese networks (RE-SiamNets), built through the use of group-equivariant convolutional layers comprising steerable filters. SiamNets allow estimating the change in orientation of the object in an unsupervised manner, thereby facilitating its use in relative 2D pose estimation as well. We further show that this change in orientation can be used to impose an additional motion constraint in Siamese…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
