SafeTail: Efficient Tail Latency Optimization in Edge Service Scheduling   via Computational Redundancy Management

Jyoti Shokhanda; Utkarsh Pal; Aman Kumar; Soumi Chattopadhyay; Arani; Bhattacharya

arXiv:2408.17171·cs.LG·October 23, 2024

SafeTail: Efficient Tail Latency Optimization in Edge Service Scheduling via Computational Redundancy Management

Jyoti Shokhanda, Utkarsh Pal, Aman Kumar, Soumi Chattopadhyay, Arani, Bhattacharya

PDF

Open Access

TL;DR

SafeTail is a novel framework that uses deep learning to optimize tail latency in edge computing by selectively replicating services, effectively balancing latency targets and resource efficiency under uncertain network conditions.

Contribution

It introduces a reward-based deep learning approach for adaptive service replication to optimize tail latency in edge environments, addressing limitations of prior fixed redundancy methods.

Findings

01

SafeTail achieves near-optimal tail latency performance.

02

Outperforms baseline strategies in diverse service scenarios.

03

Reduces resource usage while maintaining latency targets.

Abstract

Optimizing tail latency while efficiently managing computational resources is crucial for delivering high-performance, latency-sensitive services in edge computing. Emerging applications, such as augmented reality, require low-latency computing services with high reliability on user devices, which often have limited computational capabilities. Consequently, these devices depend on nearby edge servers for processing. However, inherent uncertainties in network and computation latencies stemming from variability in wireless networks and fluctuating server loads make service delivery on time challenging. Existing approaches often focus on optimizing median latency but fall short of addressing the specific challenges of tail latency in edge environments, particularly under uncertain network and computational conditions. Although some methods do address tail latency, they typically rely on…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsIoT and Edge/Fog Computing · Cloud Computing and Resource Management

Methodstravel james · Focus