Debug metric spikes and dips

When you notice unexpected spikes or dips in experiment metrics, identify what changed and determine if the data is trustworthy. This guide provides systematic approaches to debug these anomalies.

Check cumulative exposures for traffic changes

Examine the cumulative exposures chart in your experiment dashboard. The chart shows exposure patterns over time and highlights data quality issues.

Orange dots on the cumulative exposures chart mark detected anomalies in exposure traffic. Orange dots can signal:

Sudden increases or decreases in exposure counts.
Changes to experiment configuration.
Instrumentation or tracking issues.

Data quality check failures appear when traffic to the experiment drops significantly. Review these warnings to determine if the spike or dip relates to exposure problems rather than metric changes.

For more details, refer to Interpret the cumulative exposures graph.

Review experiment configuration changes

Check if the experiment's percentage rollout or traffic allocation increased during the period when you noticed the metric spike or dip.

To review recent changes:

Go to the experiments or flags table on your Amplitude homepage.
Sort by Last Modified to see which experiments changed recently.
Check the experiment's configuration history to identify what changed and when.

Use the Experiment Management API or Slack Notifications to retrieve versions of flag configurations during a specific time period. Compare configurations to identify changes that might explain the metric spike or dip.

Check for releases and deployments

Review whether any code releases, app version updates, or feature deployments occurred during the time window of the metric change.

Releases and annotations in your dashboard help identify temporal correlations between deployments and metric changes. If you haven't added annotations yet, add them after you identify the cause to document the incident.

Segment by platform and version

Metric spikes or dips often affect specific platforms or app versions rather than all users. Group your analysis by these default user properties:

App version: New app releases can introduce bugs or behavioral changes.
OS version: Operating system updates can affect app performance.
Platform: Issues might only affect iOS, Android, or web users.
Device type: Specific device models or screen sizes might behave differently.

Amplitude SDKs track these properties automatically, making them reliable dimensions for debugging. Refer to user properties for more information.

To analyze by these dimensions:

In your experiment results, add a group-by clause for the relevant property.
Look for spikes or dips that affect only specific segments.
Investigate what's unique about the affected segment.

Identify and filter outliers

Outlier users or data quality issues cause some metric spikes:

Bot traffic

Automated bot traffic can skew metrics significantly. Amplitude provides several ways to handle bots:

Use bot traffic blocking to filter known bots automatically.
Configure block filters and drop filters to exclude specific traffic patterns.

Instrumentation issues

Rapid-fire events from a single user can indicate instrumentation bugs:

Components that re-render frequently might send events on each render.
Events might need debouncing to prevent duplicate tracking.
Event handlers might fire multiple times unintentionally.

Use the frequency chart to identify users with unusually high event counts. Investigate these outliers, which can distort your metrics.

To filter outliers from your analysis:

Create a frequency distribution of your metric.
Identify users with extreme values.
Investigate whether these users represent genuine behavior or data quality issues.
Apply filters to exclude confirmed outliers from your experiment results.

Analyze exposure events by flag key

Create an event segmentation chart to identify spikes in exposure events for specific flag keys:

Go to Analytics > Event Segmentation.
Select the [Experiment] Exposure event.
Group by the flag_key property.
Sort by Row Change to see which flags had the largest exposure changes.

This analysis can reveal if a specific experiment suddenly received more traffic, which might explain metric changes.

Use Root Cause Analysis

Root Cause Analysis identifies which user segments contributed most to a metric change. You can't filter exclusively by experiment properties, but you can:

Go to the metric that showed a spike or dip.
Use Root Cause Analysis to identify contributing factors.
Look for patterns in user properties, behaviors, or segments.

Resolve and prevent recurrence

After identifying the cause of a metric spike or dip:

Document your findings with annotations in your dashboard.
If a bug caused the issue, fix the instrumentation.
If bots or outliers caused the issue, configure filters to prevent future occurrences.
If a configuration change caused the issue, restart the experiment with stable settings.
Review sample ratio mismatch troubleshooting if the spike or dip correlates with exposure distribution changes.

Was this helpful?