This article helps you:
Understand how holdout groups work in Amplitude Experiment
Create, manage, and analyze a holdout group and the experiments in it
Delve deeper into holdout groups with use case examples
Sometimes it can be useful to keep a certain percentage of users from viewing an experiment. This is especially true when measuring the long-term, combined effects of multiple experiments. Statistical significance in one experiment may not reflect the true, cumulative impact of your experiments.
Amplitude Experiment lets you easily exclude users from your experiments by creating a holdout group. Holdout groups are especially useful for measuring the long term impact of your rolled-out variants, and measuring the lift of your experimentation program as a whole.
For more information, see the article on Flag Dependencies.
This feature is available to users on Enterprise plans who have purchased Amplitude Experiment. See the pricing page for more details.
When using holdout groups, there are a few things to keep in mind:
To create a holdout group and add your experiments to it, follow these steps:
If you have existing groups, click Create A New Group, and then select Holdout Group in the drawer.
You can't change the holdout percentage after you create a group. This ensures consistent bucketing, as well as a consistent user experience.
Don't add the same users nor cohorts to both the Include a holdout and Exclude from holdout slots, as the Include a holdout slot determines inclusion.
Manage your holdout groups from the Experiment Groups tab or from within an experiment:
If you are within an experiment that's part of a holdout group, follow these steps:
Analyze your holdout groups using an Experiment Results chart.
To create a pre-populated Experiment Results chart, follow these steps:
Navigate to the Experiments page and open the Experiment Groups tab.
Find the holdout group you want to analyze and click the chart icon.
Click Open in Analytics.
A new Experiment Results chart opens, with the following fields complete:
From here, select the primary metric and start analyzing the impact of your holdout group.
Adding an experiment to multiple holdout groups may limit an experiment's traffic. This is because Experiment evaluates each user for each holdout group they belong to.
For example, imagine two holdout groups:
Since experiment A is part of both holdout groups (1 and 2), it receives the majority of the total traffic:
0.95 * 0.95 = 0.9025 (90.25%)
Instead of adding an experiment to multiple holdout groups, create a single group with all the relevant experiments instead. This allows for a more even distribution of traffic across experiments.
In the example above, you would create just one holdout group containing all three experiments (A, B, and C).
Adding an experiment to a holdout group and a mutual exclusion group can also further limit the amount of traffic to the experiment. Experiment evaluates each user for both the holdout group and the mutual exclusion group.
For example, imagine the following holdout group and mutual exclusion group:
In this scenario, experiment A receives about half of the total traffic:
0.95 * 0.5 = 0.475 (47.5%)
Using holdout groups with mutual exclusion isn't forbidden, but be cautious of the potential traffic limits as you plan and roll out your experiments.
Learn more in this article about mutual exclusion groups.
Thanks for your feedback!
June 27th, 2024
Need help? Contact Support
Visit Amplitude.com
Have a look at the Amplitude Blog
Learn more at Amplitude Academy
© 2024 Amplitude, Inc. All rights reserved. Amplitude is a registered trademark of Amplitude, Inc.