Empirical Decision Rules for Improving the Uncertainty Reporting of Small Sample System Usability Scale Scores
Nicholas Clark, Matthew Dabkowski, Patrick Driscoll, Dereck Kennedy,, Ian Kloo, Heidy Shi

TL;DR
This paper proposes empirically-based decision rules using bootstrap and Bayesian methods to improve the accuracy of uncertainty reporting in small sample SUS scores, addressing limitations of traditional confidence intervals.
Contribution
It introduces novel decision rules and an online tool that enhance confidence interval accuracy for small sample SUS assessments, especially with skewed data.
Findings
Bootstrap and Bayesian intervals outperform traditional methods in small samples.
The online application automates improved SUS analysis.
Empirical data demonstrate reduced parameter violations.
Abstract
The System Usability Scale (SUS) is a short, survey-based approach used to determine the usability of a system from an end user perspective once a prototype is available for assessment. Individual scores are gathered using a 10-question survey with the survey results reported in terms of central tendency (sample mean) as an estimate of the system's usability (the SUS study score), and confidence intervals on the sample mean are used to communicate uncertainty levels associated with this point estimate. When the number of individuals surveyed is large, the SUS study scores and accompanying confidence intervals relying upon the central limit theorem for support are appropriate. However, when only a small number of users are surveyed, reliance on the central limit theorem falls short, resulting in confidence intervals that suffer from parameter bound violations and interval widths that…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
