# Improving Robustness of Heterogeneous Serverless Computing Systems Via   Probabilistic Task Pruning

**Authors:** Chavit Denninnart, James Gentry, Mohsen Amini Salehi

arXiv: 1905.04456 · 2019-05-14

## TL;DR

This paper introduces a probabilistic task pruning method to enhance the robustness of heterogeneous serverless computing systems, significantly improving their ability to meet QoS deadlines under uncertainty and oversubscription.

## Contribution

It proposes a novel task pruning mechanism that can be integrated into existing heuristics to improve system robustness without redesigning the entire scheduling approach.

## Key findings

- Up to 35% improvement in system robustness.
- Effective in both heterogeneous and homogeneous systems.
- Enhances QoS by reducing deadline misses.

## Abstract

Cloud-based serverless computing is an increasingly popular computing paradigm. In this paradigm, different services have diverse computing requirements that justify deploying an inconsistently Heterogeneous Computing (HC) system to efficiently process them. In an inconsistently HC system, each task needed for a given service, potentially exhibits different execution times on each type of machine. An ideal resource allocation system must be aware of such uncertainties in execution times and be robust against them, so that Quality of Service (QoS) requirements of users are met. This research aims to maximize the robustness of an HC system utilized to offer a serverless computing system, particularly when the system is oversubscribed. Our strategy to maximize robustness is to develop a task pruning mechanism that can be added to existing task-mapping heuristics without altering them. Pruning tasks with a low probability of meeting their deadlines improves the likelihood of other tasks meeting their deadlines, thereby increasing system robustness and overall QoS. To evaluate the impact of the pruning mechanism, we examine it on various configurations of heterogeneous and homogeneous computing systems. Evaluation results indicate a considerable improvement (up to 35%) in the system robustness.

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/1905.04456/full.md

## Figures

19 figures with captions in the complete paper: https://tomesphere.com/paper/1905.04456/full.md

## References

37 references — full list in the complete paper: https://tomesphere.com/paper/1905.04456/full.md

---
Source: https://tomesphere.com/paper/1905.04456