Ethosight: A Reasoning-Guided Iterative Learning System for Nuanced   Perception based on Joint-Embedding & Contextual Label Affinity

Hugo Latapie; Shan Yu; Patrick Hammer; Kristinn R. Thorisson; Vahagn; Petrosyan; Brandon Kynoch; Alind Khare; Payman Behnam; Alexey Tumanov,; Aksheit Saxena; Anish Aralikatti; Hanning Chen; Mohsen Imani; Mike Archbold,; Tangrui Li; Pei Wang; Justin Hart

arXiv:2307.10577·cs.CV·August 22, 2023

Ethosight: A Reasoning-Guided Iterative Learning System for Nuanced Perception based on Joint-Embedding & Contextual Label Affinity

Hugo Latapie, Shan Yu, Patrick Hammer, Kristinn R. Thorisson, Vahagn, Petrosyan, Brandon Kynoch, Alind Khare, Payman Behnam, Alexey Tumanov,, Aksheit Saxena, Anish Aralikatti, Hanning Chen, Mohsen Imani, Mike Archbold,, Tangrui Li, Pei Wang, Justin Hart

PDF

Open Access

TL;DR

Ethosight is a zero-shot video analytics system that uses joint embedding and reasoning with ontologies to achieve adaptable, low-cost, and continuous learning in real-world scenarios, reducing reliance on extensive data annotation.

Contribution

This work introduces Ethosight, a novel reasoning-guided iterative learning system that operates effectively on edge devices and supports continuous adaptation without catastrophic forgetting.

Findings

01

Effective in diverse complex use cases

02

Operates on low-cost edge devices

03

Supports continuous learning and adaptation

Abstract

Traditional computer vision models often necessitate extensive data acquisition, annotation, and validation. These models frequently struggle in real-world applications, resulting in high false positive and negative rates, and exhibit poor adaptability to new scenarios, often requiring costly retraining. To address these issues, we present Ethosight, a flexible and adaptable zero-shot video analytics system. Ethosight begins from a clean slate based on user-defined video analytics, specified through natural language or keywords, and leverages joint embedding models and reasoning mechanisms informed by ontologies such as WordNet and ConceptNet. Ethosight operates effectively on low-cost edge devices and supports enhanced runtime adaptation, thereby offering a new approach to continuous learning without catastrophic forgetting. We provide empirical validation of Ethosight's promising…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMultimodal Machine Learning Applications · Human Pose and Action Recognition · Machine Learning and Data Classification