Performance and caching

Amplitude Experiment evaluation supports two modes, local and remote, each with different performance metrics and tradeoffs.

Performance

Evaluation performance depends on the type of evaluation you use, and from where the request originates.

Amplitude hosts data centers in the US and EU: us-west-2 and eu-central-1, respectively. You will experience higher latency if you make requests to data centers geographically distant from the data center that directly hosts your Amplitude data.

Remote evaluation

Remote evaluation uses Fastly to cache evaluation results for a user. Cache hits serve variants from the edge, directly improving performance.

The following results are synthetic remote evaluation test requests to Amplitude's US data center collected over the last six (6) months. Latency includes Domain Name System (DNS) resolution, transport layer security (TLS) connection as well as the remote evaluation request response round trip.

Cache Average
HIT 35.9ms
MISS 194.21ms

Cache Average
HIT 48.21ms
MISS 100.56ms

Cache Average
HIT 18.18ms
MISS 262.62ms

Cache Average
HIT 31.96ms
MISS 214.3ms

Cache Average
HIT 28.84ms
MISS 239.09ms

Local evaluation

Local evaluation pre-fetches flag configurations which are then used to evaluate all users, saving a network request and speeding up evaluation compared to remote evaluation.

The following results are for a single flag evaluation, collected over 10 executions of 10,000 iterations of evaluation with randomized user inputs evaluated for 1 flag configuration. These results were selected at random out of three (3) possible flag configurations.

SDK Average Median Cold Start
Node.js 0.025ms 0.018ms 3ms
Go 0.098ms 0.071ms 0.7ms
JVM 0.007ms 0.005ms 6ms

CDN caching

Content delivery network

  • A Conten Delivery Network (CDN)) refers to a geographically distributed group of servers that work together to provide fast delivery of Internet content.
  • The CDN caches variant responses for remote evaluation and flag configurations for local evaluation.

After Experiment computes and retrieves a response for a request, it caches that request for reuse to make future requests faster. Experiment uses a CDN to cache the experiments and feature flags for a user for low latency access on subsequent requests.

Cache time-to-live (TTL)

Experiment caches requests to the server on the CDN. It's a TTL (time-to-live) cache and expires after 60 minutes independent of whether or not the key is accessed within that 60 minute period. The 60 minutes cache time window starts from the first page load.

Cache key

The CDN caches the exact request received, including user information. Any change in user info always misses the CDN cache (unless that exact same request was cached before).

Cache invalidation

To make sure you don't receive stale results when your underlying flags have changed, Experiment invalidates (deletes) cached results for an entire deployment whenever a flag or experiment associated with that deployment is updated. As the SDKs retrieves results for all experiments and feature flags for a given deployment for a user, Experiment invalidates all results for a given deployment every time there's a change in even a single flag associated with a deployment. Experiment also invalidates all requests cached for a deployment every time the deployment is added to a flag or removed from a flag.

Dynamic targeting cache considerations

Experiment lets you target feature-flags and experiments based on dynamic properties (such as user properties and behavioral cohorts) synced from Analytics. Because these properties aren't included in the fetch request, you may receive cached experiment results for up to one (1) hour (the TTL) until the cache misses and the user is re-evaluated with the most recent dynamic properties.

Amplitude user properties

Experiment's remote evaluation servers can target based on user properties previously identified with the user. Because the CDN caches responses based only on user properties passed explicitly in the request, the caller may still receive stale results for up to one (1) hour, even if the user properties in Analytics are updated and would cause the user to be evaluated into a different variant.

Behavioral cohorts

You may want to use behavioral cohorts defined in Amplitude Analytics in your flag and experiment targeting. Because experiment cohorts are computed hourly and the CDN cache TTL is also hourly, a user may be delayed from being targeted to a variant for up-to two (2) hours.

Tip

Amplitude Experiment recommends only using dynamic cohort targeting for flags and experiments where the inclusion in a variant of a flag isn't time sensitive.

Was this page helpful?

May 28th, 2024

Need help? Contact Support

Visit Amplitude.com

Have a look at the Amplitude Blog

Learn more at Amplitude Academy

© 2025 Amplitude, Inc. All rights reserved. Amplitude is a registered trademark of Amplitude, Inc.