High Fidelity Interactive Video Segmentation Using Tensor Decomposition   Boundary Loss Convolutional Tessellations and Context Aware Skip Connections

Anthony D. Rhodes; Manan Goel

arXiv:2011.11602·cs.CV·November 24, 2020

High Fidelity Interactive Video Segmentation Using Tensor Decomposition Boundary Loss Convolutional Tessellations and Context Aware Skip Connections

Anthony D. Rhodes, Manan Goel

PDF

TL;DR

HyperSeg is a high fidelity interactive video segmentation algorithm that maintains high resolution features and improves accuracy using tensor decomposition, tessellation, and boundary loss, suitable for VFX and medical imaging.

Contribution

The paper introduces HyperSeg, a novel high-resolution video segmentation method employing tensor decomposition, tessellation, and boundary loss for improved fidelity and temporal coherence.

Findings

01

Demonstrates superior accuracy over baseline models on high-resolution video data.

02

Introduces the VFX Segmentation Dataset with over 27,000 annotated frames.

03

Achieves high fidelity segmentation without downsampling or pooling.

Abstract

We provide a high fidelity deep learning algorithm (HyperSeg) for interactive video segmentation tasks using a convolutional network with context-aware skip connections, and compressed, hypercolumn image features combined with a convolutional tessellation procedure. In order to maintain high output fidelity, our model crucially processes and renders all image features in high resolution, without utilizing downsampling or pooling procedures. We maintain this consistent, high grade fidelity efficiently in our model chiefly through two means: (1) We use a statistically-principled tensor decomposition procedure to modulate the number of hypercolumn features and (2) We render these features in their native resolution using a convolutional tessellation technique. For improved pixel level segmentation results, we introduce a boundary loss function; for improved temporal coherence in video…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.