Making Data Validation Suck Less

Today, we’re excited to release Schema, the newest addition to Taxonomy, our product add-on to help you expedite data validation and take full control over the data that is collected by Amplitude.

Inside Amplitude
March 27, 2018
Image of Mike Ottavi-Brannon
Mike Ottavi-Brannon
Principal Product Manager
Making Data Validation Suck Less

“Data validation is fun!” – No one ever

Data validation. Two incredibly important words, yet one of the most painful parts of any analytics instrumentation. Whether you’re getting started with product analytics for the first time, or releasing a new feature and making updates to your current implementation, ensuring that the data you collect is accurate can be a nightmare for large organizations and small startups alike.

Today, we’re excited to release Schema, the newest addition to Taxonomy, our product add-on to help you expedite data validation and take full control over the data that is collected by Amplitude.

Instrumentation planning for automated data validation

How often have you instrumented new event data for Amplitude only to hear from someone on your team that it’s incorrect, missing values, or entirely unexpected? With major feature releases, or big taxonomy updates, doing proper validation can take a tremendous amount of time and effort. Mistakes often slip through the cracks and wind up confusing an unsuspecting member of your team with funky-looking data.

We built Schema to solve this problem.

data-validation-taxonomy-schema

Schema helps you better automate your data validation process by letting you build instrumentation plans within the tool. You can also upload a plan you have created outside of Amplitude. As data is collected, Schema will automatically expose the event data that is outside of your plan and notify the appropriate members of your team.

data-validation-taxonomy-schema

Related Reading: The Foundation for Great Analytics is a Great Taxonomy

Schema automatically detects any event, event property, or event property value that you did not intend to ingest. It proactively surfaces those unwanted, unexpected events and properties so you know exactly what your team needs to address when QAing instrumentation updates.

unexpected-events-properties-schema

Ingestion whitelisting for total data governance

Schema adds an additional layer of confidence, integrity and trust to the data you see in Amplitude. Having confidence in your data means you aren’t accidentally tracking sensitive information about your customers.

Related Reading: Analytics that Doesn’t Compromise on Data Integrity

Schema allows you to specify exactly which events, properties, and values you want to ingest into Amplitude so you can avoid accidentally ingesting sensitive data such as personally identifiable information (PII).

Data integrity is at the core of any product analytics solution, yet making sure that what you actually collect is what you intended to collect has been a challenge that eludes some of the most diligent teams. With Schema we hope to make data validation suck a whole lot less and get you to actionable insights faster than ever.

schema-settings-amplitude

Get in touch with us to learn more about how Schema and Taxonomy can help you master your product’s event taxonomy quickly and easily.

About the Author
Image of Mike Ottavi-Brannon
Mike Ottavi-Brannon
Principal Product Manager
Mike is a self-proclaimed amateur chef and Principal Product Manager at Amplitude where he runs the Analytics product group.