Quality at the Tail of Machine Learning Inference

Zhengxin Yang; Wanling Gao; Chunjie Luo; Lei Wang; Fei; Tang; Xu Wen; Jianfeng Zhan

arXiv:2212.13925·cs.LG·February 27, 2024

Quality at the Tail of Machine Learning Inference

Zhengxin Yang, Wanling Gao, Chunjie Luo, Lei Wang, Fei, Tang, Xu Wen, Jianfeng Zhan

PDF

Open Access

TL;DR

This paper introduces the concept of 'tail quality' to evaluate fluctuations in deep learning inference quality under time constraints, proposing a framework to predict quality variations in safety-critical applications.

Contribution

It uncovers the phenomenon of inference quality fluctuations due to inference time and proposes a new evaluation framework to analyze and predict these fluctuations.

Findings

01

Inference quality fluctuates with inference time.

02

The proposed framework effectively predicts quality distribution.

03

Experiments validate the framework across multiple models and systems.

Abstract

Machine learning inference should be subject to stringent inference time constraints while ensuring high inference quality, especially in safety-critical (e.g., autonomous driving) and mission-critical (e.g., emotion recognition) contexts. Neglecting either aspect can lead to severe consequences, such as loss of life and property damage. Many studies lack a comprehensive consideration of these metrics, leading to incomplete or misleading evaluations. The study unveils a counterintuitive revelation: deep learning inference quality exhibits fluctuations due to inference time. To depict this phenomenon, the authors coin a new term, "tail quality," providing a more comprehensive evaluation, and overcoming conventional metric limitations. Moreover, the research proposes an initial evaluation framework to analyze factors affecting quality fluctuations, facilitating the prediction of the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Explainable Artificial Intelligence (XAI) · Generative Adversarial Networks and Image Synthesis