Fast Interactive Video Object Segmentation with Graph Neural Networks

Viktor Varga; Andr\'as L\H{o}rincz

arXiv:2103.03821·cs.CV·April 22, 2021

Fast Interactive Video Object Segmentation with Graph Neural Networks

Viktor Varga, Andr\'as L\H{o}rincz

PDF

1 Repo

TL;DR

This paper introduces a graph neural network approach for interactive video object segmentation that operates on superpixel graphs, achieving state-of-the-art results with fewer parameters and faster inference.

Contribution

The paper presents a novel GNN-based method that reduces problem dimensionality and training data requirements for interactive video segmentation.

Findings

01

Achieves state-of-the-art performance

02

Operates with only a few thousand parameters

03

Fast inference and quick training with little data

Abstract

Pixelwise annotation of image sequences can be very tedious for humans. Interactive video object segmentation aims to utilize automatic methods to speed up the process and reduce the workload of the annotators. Most contemporary approaches rely on deep convolutional networks to collect and process information from human annotations throughout the video. However, such networks contain millions of parameters and need huge amounts of labeled training data to avoid overfitting. Beyond that, label propagation is usually executed as a series of frame-by-frame inference steps, which is difficult to be parallelized and is thus time consuming. In this paper we present a graph neural network based approach for tackling the problem of interactive video object segmentation. Our network operates on superpixel-graphs which allow us to reduce the dimensionality of the problem by several magnitudes. We…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

vvarga90/gnn_annot
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsGraph Neural Network