We Have So Much In Common: Modeling Semantic Relational Set Abstractions   in Videos

Alex Andonian; Camilo Fosco; Mathew Monfort; Allen Lee; Rogerio Feris,; Carl Vondrick; and Aude Oliva

arXiv:2008.05596·cs.CV·August 14, 2020

We Have So Much In Common: Modeling Semantic Relational Set Abstractions in Videos

Alex Andonian, Camilo Fosco, Mathew Monfort, Allen Lee, Rogerio Feris,, Carl Vondrick, and Aude Oliva

PDF

1 Repo

TL;DR

This paper introduces a method for learning semantic relational set abstractions in videos by combining visual features with natural language supervision, enabling cognitive tasks like set abstraction, completion, and odd one out detection.

Contribution

It presents a novel approach that explicitly learns relational abstractions with semantic supervision for improved video set understanding.

Findings

01

Significant improvements over baseline algorithms.

02

Robust and versatile representations emerge from learning commonalities.

03

Effective in tasks like set abstraction, completion, and odd one out detection.

Abstract

Identifying common patterns among events is a key ability in human and machine perception, as it underlies intelligent decision making. We propose an approach for learning semantic relational set abstractions on videos, inspired by human learning. We combine visual features with natural language supervision to generate high-level representations of similarities across a set of videos. This allows our model to perform cognitive tasks such as set abstraction (which general concept is in common among a set of videos?), set completion (which new video goes well with the set?), and odd one out detection (which video does not belong to the set?). Experiments on two video benchmarks, Kinetics and Multi-Moments in Time, show that robust and versatile representations emerge when learning to recognize commonalities among sets. We compare our model to several baseline algorithms and show that…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

alexandonian/relational-set-abstraction
pytorch

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.