Guides and Surveys

Feature Experiment Web Experiment

SDKs APIs

Overview of Amplitude Data Getting set up with Amplitude Data Autocapture Visual Labeling Data structure video walkthrough Debug with the Amplitude Chrome extension Manage your Amplitude Data settings Data backfill

Plan your taxonomy Planning and instrumentation workflow Create a tracking plan Work with branches Ampli and developer tools Monitor your data with observe Protect your schema from unexpected data Integrate Jira with Amplitude Data Manage access to sensitive data with Data Access Control

About user properties and event properties Event and property descriptions Event and property display names Event categorization Set an event's activity status Property types IP address, location, user agent, and device properties Override a property definition Revert an overridden property definition Using property groups Cross-project analysis with Portfolios

Organize your tracking plan with Data Assistant Remove invalid or incorrect data Block bot web traffic Change the description of an event or property Fix your data with transformations Time to Live (TTL)

Custom events Object management Currency Conversion Derived properties Lookup Tables Channels

Warehouse-native Amplitude: Overview Build a warehouse-native data model Warehouse-native Amplitude: Best Practices Warehouse-native Bulk Model Management Warehouse-native DBT Integration

Activation overview

Recommendations: Help users reach the goals you've set for them

Build a recommendation

Use recommendations in personalization campaigns

Predictions: Use Amplitude's AI to help maximize lift

Build a prediction

Sync to third-party destinations

Data Mutability Features

Source catalog

Connect to a source

Understand the data differences between Amplitude, Snowflake, and the Export API

Profiles

Amplitude Wordpress plugin

Amplitude Shopify Plugin

Converter configuration reference

Track sessions

Destination catalog

Connect to a Destination

Destination event streaming overview

Client-side vs Server-side

Streaming transformations

Sync Cohorts with Destinations

Datagrail

Osano

Transcend

/

Warehouse-native Amplitude: Best Practices

Warehouse-native Amplitude: Best Practices

Warehouse-native Amplitude enables you to bring your own models to your analyses. However, to get the most out of your data as quickly as possible, you should consider these best practices:

Clustering key

Choose appropriate columns for clustering keys based on the query patterns and filtering conditions in your analytics workload.
- For example, for event (fact) tables, cluster on event time using the LINEAR() function.
Avoid using columns with high cardinality as clustering keys. This can lead to inefficiencies in data storage and query performance.
Use composite clustering keys with multiple columns often used in join operations, or for filtering.

Schema format

Use a star schema or Snowflake schema to optimize query performance and simplify data analysis.
- Star schema: features a central fact table linked to dimension tables, suitable for simpler queries and faster aggregations.
- Snowflake schema: a normalized version of the star schema, which minimizes data redundancy and improves data integrity at the cost of more complex queries.

Partition and clustering

Partition large tables to reduce the amount of data scanned, and improve query performance.

Was this page helpful?

June 4th, 2024

Need help? Contact Support

Visit Amplitude.com

Have a look at the Amplitude Blog

Learn more at Amplitude Academy

Terms of Service Privacy Notice Acceptable Use Policy Legal

© 2025 Amplitude, Inc. All rights reserved. Amplitude is a registered trademark of Amplitude, Inc.