# BubbleNets: Learning to Select the Guidance Frame in Video Object   Segmentation by Deep Sorting Frames

**Authors:** Brent A. Griffin, Jason J. Corso

arXiv: 1903.11779 · 2020-11-25

## TL;DR

This paper introduces BubbleNets, a deep learning approach that intelligently selects the optimal frame for annotation in video object segmentation, significantly improving performance without altering existing segmentation methods.

## Contribution

The paper proposes BubbleNets, a novel deep sorting network that learns to identify the best annotation frame, enhancing segmentation accuracy in videos.

## Key findings

- Achieves 11% relative improvement on DAVIS benchmark
- Effectively leverages existing datasets for training
- Improves segmentation performance by selecting optimal frames

## Abstract

Semi-supervised video object segmentation has made significant progress on real and challenging videos in recent years. The current paradigm for segmentation methods and benchmark datasets is to segment objects in video provided a single annotation in the first frame. However, we find that segmentation performance across the entire video varies dramatically when selecting an alternative frame for annotation. This paper address the problem of learning to suggest the single best frame across the video for user annotation-this is, in fact, never the first frame of video. We achieve this by introducing BubbleNets, a novel deep sorting network that learns to select frames using a performance-based loss function that enables the conversion of expansive amounts of training examples from already existing datasets. Using BubbleNets, we are able to achieve an 11% relative improvement in segmentation performance on the DAVIS benchmark without any changes to the underlying method of segmentation.

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/1903.11779/full.md

## Figures

10 figures with captions in the complete paper: https://tomesphere.com/paper/1903.11779/full.md

## References

47 references — full list in the complete paper: https://tomesphere.com/paper/1903.11779/full.md

---
Source: https://tomesphere.com/paper/1903.11779