# Safely and Quickly Deploying New Features with a Staged Rollout   Framework Using Sequential Test and Adaptive Experimental Design

**Authors:** Zhenyu Zhao, Mandie Liu, Anirban Deb

arXiv: 1905.10493 · 2019-05-28

## TL;DR

This paper introduces an automated, adaptive framework for deploying new features safely and efficiently by gradually increasing rollout size based on real-time performance monitoring and decision algorithms.

## Contribution

It proposes a novel automated staged rollout framework using sequential testing and adaptive ramp-up algorithms, enhancing safety and speed of feature deployment.

## Key findings

- Early regression detection reduces negative impact.
- Faster rollout for healthy features.
- Automation decreases manual intervention.

## Abstract

During the rapid development cycle for Internet products (websites and mobile apps), new features are developed and rolled out to users constantly. Features with code defects or design flaws can cause outages and significant degradation of user experience. The traditional method of code review and change management can be time-consuming and error-prone. In order to make the feature rollout process safe and fast, this paper proposes a methodology for rolling out features in an automated way using an adaptive experimental design. Under this framework, a feature is gradually ramped up from a small proportion of users to a larger population based on real-time evaluation of the performance of important metrics. If there are any regression detected during the ramp-up step, the ramp-up process stops and the feature developer is alerted. There are two main algorithm components powering this framework: 1) a continuous monitoring algorithm - using a variant of the sequential probability ratio test (SPRT) to monitor the feature performance metrics and alert feature developers when a metric degradation is detected, 2) an automated ramp-up algorithm - deciding when and how to ramp up to the next stage with larger sample size. This paper presents one monitoring algorithm and three ramping up algorithms including time-based, power-based, and risk-based (a Bayesian approach) schedules. These algorithms are evaluated and compared on both simulated data and real data. There are three benefits provided by this framework for feature rollout: 1) for defective features, it can detect the regression early and reduce negative effect, 2) for healthy features, it rolls out the feature quickly, 3) it reduces the need for manual intervention via the automation of the feature rollout process.

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/1905.10493/full.md

## Figures

6 figures with captions in the complete paper: https://tomesphere.com/paper/1905.10493/full.md

## References

25 references — full list in the complete paper: https://tomesphere.com/paper/1905.10493/full.md

---
Source: https://tomesphere.com/paper/1905.10493