# Quadruplet Network with One-Shot Learning for Fast Visual Object   Tracking

**Authors:** Xingping Dong, Jianbing Shen, Dongming Wu, Kan Guo and, Xiaogang Jin, Fatih Porikli

arXiv: 1705.07222 · 2019-07-16

## TL;DR

This paper introduces a quadruplet deep network for one-shot visual object tracking, leveraging multi-instance relationships and a novel loss function to improve accuracy and speed in real-time applications.

## Contribution

The paper proposes a quadruplet network with a new loss function and weight layer, enhancing one-shot learning for object tracking beyond traditional pair-based methods.

## Key findings

- Achieves superior tracking performance on benchmarks.
- Operates at 78 frames-per-second in real-time.
- Outperforms existing methods in accuracy and speed.

## Abstract

In the same vein of discriminative one-shot learning, Siamese networks allow recognizing an object from a single exemplar with the same class label. However, they do not take advantage of the underlying structure of the data and the relationship among the multitude of samples as they only rely on pairs of instances for training. In this paper, we propose a new quadruplet deep network to examine the potential connections among the training instances, aiming to achieve a more powerful representation. We design four shared networks that receive multi-tuple of instances as inputs and are connected by a novel loss function consisting of pair-loss and triplet-loss. According to the similarity metric, we select the most similar and the most dissimilar instances as the positive and negative inputs of triplet loss from each multi-tuple. We show that this scheme improves the training performance. Furthermore, we introduce a new weight layer to automatically select suitable combination weights, which will avoid the conflict between triplet and pair loss leading to worse performance. We evaluate our quadruplet framework by model-free tracking-by-detection of objects from a single initial exemplar in several Visual Object Tracking benchmarks. Our extensive experimental analysis demonstrates that our tracker achieves superior performance with a real-time processing speed of 78 frames-per-second (fps).

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/1705.07222/full.md

## Figures

18 figures with captions in the complete paper: https://tomesphere.com/paper/1705.07222/full.md

## References

51 references — full list in the complete paper: https://tomesphere.com/paper/1705.07222/full.md

---
Source: https://tomesphere.com/paper/1705.07222