The Efficiency Misnomer

Mostafa Dehghani; Anurag Arnab; Lucas Beyer; Ashish Vaswani; and Yi Tay

arXiv:2110.12894·cs.LG·March 17, 2022

The Efficiency Misnomer

Mostafa Dehghani, Anurag Arnab, Lucas Beyer, Ashish Vaswani, and Yi Tay

PDF

Open Access 1 Models 2 Videos

TL;DR

This paper critically examines the common assumptions about model efficiency metrics in machine learning, highlighting inconsistencies and proposing better reporting practices to provide a clearer understanding of models' practical costs.

Contribution

It offers a comprehensive analysis of efficiency metrics, discusses their contradictions, and suggests improvements for reporting to enhance transparency and comparability.

Findings

01

Efficiency metrics often contradict each other.

02

Incomplete reporting leads to partial conclusions.

03

Better reporting practices can improve understanding of model costs.

Abstract

Model efficiency is a critical aspect of developing and deploying machine learning models. Inference time and latency directly affect the user experience, and some applications have hard requirements. In addition to inference costs, model training also have direct financial and environmental impacts. Although there are numerous well-established metrics (cost indicators) for measuring model efficiency, researchers and practitioners often assume that these metrics are correlated with each other and report only few of them. In this paper, we thoroughly discuss common cost indicators, their advantages and disadvantages, and how they can contradict each other. We demonstrate how incomplete reporting of cost indicators can lead to partial conclusions and a blurred or incomplete picture of the practical considerations of different models. We further present suggestions to improve reporting of…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Models

🤗
fcxfcx/owlv2
model· ♡ 1
♡ 1

Videos

The efficiency misnomer | Size does not matter | What does the number of parameters mean in a model?· youtube

The Efficiency Misnomer· slideslive

Taxonomy

TopicsMachine Learning and Data Classification · Explainable Artificial Intelligence (XAI) · Anomaly Detection Techniques and Applications