Non-parametric Contextual Relationship Learning for Semantic Video   Object Segmentation

Tinghuai Wang; Huiling Wang

arXiv:2407.05916·cs.CV·July 9, 2024

Non-parametric Contextual Relationship Learning for Semantic Video Object Segmentation

Tinghuai Wang, Huiling Wang

PDF

Open Access

TL;DR

This paper introduces a graph-based, non-parametric method for modeling and propagating semantic contextual relationships in videos to improve object segmentation accuracy.

Contribution

It presents a novel exemplar-based, non-parametric approach that encodes relationships on a similarity graph and integrates learned contexts into a CRF for semantic labeling.

Findings

01

Outperforms state-of-the-art methods on YouTube-Objects dataset

02

Effectively models spatial-temporal contextual relationships

03

Enhances semantic segmentation accuracy

Abstract

We propose a novel approach for modeling semantic contextual relationships in videos. This graph-based model enables the learning and propagation of higher-level spatial-temporal contexts to facilitate the semantic labeling of local regions. We introduce an exemplar-based nonparametric view of contextual cues, where the inherent relationships implied by object hypotheses are encoded on a similarity graph of regions. Contextual relationships learning and propagation are performed to estimate the pairwise contexts between all pairs of unlabeled local regions. Our algorithm integrates the learned contexts into a Conditional Random Field (CRF) in the form of pairwise potentials and infers the per-region semantic labels. We evaluate our approach on the challenging YouTube-Objects dataset which shows that the proposed contextual relationship model outperforms the state-of-the-art methods.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsVideo Surveillance and Tracking Methods · Advanced Image and Video Retrieval Techniques · Multimodal Machine Learning Applications