Efficient Semantic Image Communication for Traffic Monitoring at the Edge

Damir Assylbek; Nurmukhammed Aitymbetov; Marko Ristin; Dimitrios Zorbas

arXiv:2604.12622·cs.CV·April 15, 2026

Efficient Semantic Image Communication for Traffic Monitoring at the Edge

Damir Assylbek, Nurmukhammed Aitymbetov, Marko Ristin, Dimitrios Zorbas

PDF

TL;DR

This paper introduces two semantic image communication pipelines, MMSD and SAMR, for traffic monitoring that significantly reduce data transmission while maintaining meaningful visual information using generative models.

Contribution

The paper proposes novel semantic communication methods MMSD and SAMR that enhance compression and privacy in traffic monitoring at the edge with generative reconstruction.

Findings

01

MMSD reduces transmitted data by 99% and preserves semantic consistency.

02

SAMR achieves a 99.1% data reduction with better quality-compression trade-off.

03

Edge processing times are approximately 15s for MMSD and 9s for SAMR on Raspberry Pi 5.

Abstract

Many visual monitoring systems operate under strict communication constraints, where transmitting full-resolution images is impractical and often unnecessary. In such settings, visual data is often used for object presence, spatial relationships, and scene context rather than exact pixel fidelity. This paper presents two semantic image communication pipelines for traffic monitoring, MMSD and SAMR, that reduce transmission cost while preserving meaningful visual information. MMSD (Multi-Modal Semantic Decomposition) targets very high compression together with data confidentiality, since sensitive pixel content is not transmitted. It replaces the original image with compact semantic representations, namely segmentation maps, edge maps, and textual descriptions, and reconstructs the scene at the receiver using a diffusion-based generative model. SAMR (Semantic-Aware Masking Reconstruction)…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.