Validating Big Data at Scale

A talk from CEO/co-founder Spenser Skates on the key data challenges that we face collecting data from hundreds of millions of devices

Inside Amplitude
October 2, 2014
Image of datamonster
Data Monster
Mascot of Amplitude
Validating Big Data at Scale

A couple months ago, our co-founder and CEO Spenser had the pleasure of giving a tech talk hosted by our good friends at KeepSafe. He went over some of the key data challenges that we face when we’re simultaneously collecting data from hundreds of millions of devices, including:

  • data getting mangled in transit
  • client sending the same data twice
  • device clocks being inaccurate

Check out the video and slides if you’re a data scientist facing similar problems — or if you’re just interested in how we clean up our customers’ user data!

Screenshot 2018-04-19 17.06.28

Validating big data at scale from Amplitude Analytics

About the Author
Image of datamonster
Data Monster
Mascot of Amplitude
Datamonster spends most of its time nom nomming data and fulfilling duties as a cultural icon and brand ambassador for Amplitude. Datamonster wants everyone to know that there's a little data monster in all of us.