How to Do Statistical Evaluations in ECE/CS Papers: A Practical Playbook for Defensible Results
Bhaskar Krishnamachari

TL;DR
This paper provides a practical, example-driven guide for conducting statistically sound evaluations in ECE/CS research, emphasizing a comprehensive workflow and modern techniques.
Contribution
It introduces a structured evaluation workflow with classical and modern statistical methods, tailored for ECE/CS researchers, including Python examples and supplementary materials.
Findings
Clarifies the evaluation process from claim to reporting.
Integrates classical and distribution-free statistical techniques.
Provides practical Python snippets and supplementary educational resources.
Abstract
Strong experimental papers in electrical and computer engineering and computer science (ECE/CS), especially in systems, networking, and applied machine learning, rest on more than a single impressive number. They rest on a chain of design, measurement, analysis, and validation choices that, taken together, make a result believable. This tutorial is a compact, example-driven guide to that chain for beginning researchers. We organize it as an evaluation workflow: claim, hypothesis, unit of analysis, baseline, regime sweep, uncertainty estimate, validation check, and reporting. Within that workflow we cover the classical statistical foundations (descriptive statistics, the central limit theorem, normal- and -based confidence intervals, Student's -test, ANOVA, chi-squared and Pearson correlation, linear regression) alongside the modern, distribution-free techniques (the bootstrap,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
