Semantics-Driven Cloud-Edge Collaborative Inference
Yuche Gao, Beibei Zhang

TL;DR
This paper introduces a semantics-driven cloud-edge collaborative framework for video inference in smart city applications, significantly improving speed, throughput, and traffic efficiency by partitioning tasks between edge and cloud based on semantics.
Contribution
It presents a novel semantics-driven partitioning approach for cloud-edge collaboration in video analytics, reducing latency and traffic while enhancing throughput.
Findings
Up to 5x faster inference speed
Up to 9 FPS throughput
50% reduction in traffic volume
Abstract
With the proliferation of video data in smart city applications like intelligent transportation, efficient video analytics has become crucial but also challenging. This paper proposes a semantics-driven cloud-edge collaborative approach for accelerating video inference, using license plate recognition as a case study. The method separates semantics extraction and recognition, allowing edge servers to only extract visual semantics (license plate patches) from video frames and offload computation-intensive recognition to the cloud or neighboring edges based on load. This segmented processing coupled with a load-aware work distribution strategy aims to reduce end-to-end latency and improve throughput. Experiments demonstrate significant improvements in end-to-end inference speed (up to 5x faster), throughput (up to 9 FPS), and reduced traffic volumes (50% less) compared to cloud-only or…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Vehicle License Plate Recognition · Automated Road and Building Extraction
MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings
