# A/B Testing Measurement Framework for Recommendation Models Based on   Expected Revenue

**Authors:** Meisam Hejazinia, Majid Hosseini, Bryant Sih

arXiv: 1906.06390 · 2019-06-18

## TL;DR

This paper introduces a statistical framework for evaluating whether new recommendation systems improve revenue per visit by analyzing conversion rate and order value, using tests that require smaller samples and provide confidence intervals.

## Contribution

It presents a novel A/B testing method for recommendation models that efficiently detects revenue improvements with smaller samples and no subjective outlier removal.

## Key findings

- Method requires fewer samples than existing approaches.
- Assumes log-normal distribution for purchase values, validated empirically.
- Provides confidence intervals to quantify uncertainty.

## Abstract

We provide a method to determine whether a new recommendation system improves the revenue per visit (RPV) compared to the status quo. We achieve our goal by splitting RPV into conversion rate and average order value (AOV). We use the two-part test suggested by Lachenbruch to determine if the data generating process in the new system is different. In cases that this test does not give us a definitive answer about the change in RPV, we propose two alternative tests to determine if RPV has changed. Both of these tests rely on the assumption that non-zero purchase values follow a log-normal distribution. We empirically validate this assumption using data collected at different points in time from Staples.com. On average, our method needs a smaller sample size than other methods. Furthermore, it does not require any subjective outlier removal. Finally, it characterizes the uncertainty around RPV by providing a confidence interval.

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/1906.06390/full.md

## Figures

2 figures with captions in the complete paper: https://tomesphere.com/paper/1906.06390/full.md

## References

18 references — full list in the complete paper: https://tomesphere.com/paper/1906.06390/full.md

---
Source: https://tomesphere.com/paper/1906.06390