Motion-Compensated Latent Semantic Canvases for Visual Situational Awareness on Edge

Igor Lodin; Sergii Filatov; Vira Filatova; Dmytro Filatov

arXiv:2601.00854·cs.CV·January 6, 2026

Motion-Compensated Latent Semantic Canvases for Visual Situational Awareness on Edge

Igor Lodin, Sergii Filatov, Vira Filatova, Dmytro Filatov

PDF

Open Access

TL;DR

This paper introduces Motion-Compensated Latent Semantic Canvases (MCLSC), a method for efficient visual situational awareness on edge devices by maintaining persistent semantic data with motion compensation, reducing segmentation calls and processing time.

Contribution

The paper presents a novel approach that combines motion compensation with latent semantic canvases to significantly reduce computational load on resource-constrained devices.

Findings

01

Reduces segmentation calls by over 30 times

02

Lowers processing time by over 20 times

03

Maintains coherent semantic overlays

Abstract

We propose Motion-Compensated Latent Semantic Canvases (MCLSC) for visual situational awareness on resource-constrained edge devices. The core idea is to maintain persistent semantic metadata in two latent canvases - a slowly accumulating static layer and a rapidly updating dynamic layer - defined in a baseline coordinate frame stabilized from the video stream. Expensive panoptic segmentation (Mask2Former) runs asynchronously and is motion-gated: inference is triggered only when motion indicates new information, while stabilization/motion compensation preserves a consistent coordinate system for latent semantic memory. On prerecorded 480p clips, our prototype reduces segmentation calls by >30x and lowers mean end-to-end processing time by >20x compared to naive per-frame segmentation, while maintaining coherent static/dynamic semantic overlays.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMultimodal Machine Learning Applications · Human Pose and Action Recognition · Advanced Image and Video Retrieval Techniques