Platform

AI

AI Agents
Sense, decide, and act faster than ever before
AI Visibility
See how your brand shows up in AI search
AI Feedback
Distill what your customers say they want
Amplitude MCP
Insights from the comfort of your favorite AI tool

Insights

Product Analytics
Understand the full user journey
Marketing Analytics
Get the metrics you need with one line of code
Session Replay
Visualize sessions based on events in your product
Heatmaps
Visualize clicks, scrolls, and engagement

Action

Guides and Surveys
Guide your users and collect feedback
Feature Experimentation
Innovate with personalized product experiences
Web Experimentation
Drive conversion with A/B testing powered by data
Feature Management
Build fast, target easily, and learn as you ship
Activation
Unite data across teams

Data

Warehouse-native Amplitude
Unlock insights from your data warehouse
Data Governance
Complete data you can trust
Security & Privacy
Keep your data secure and compliant
Integrations
Connect Amplitude to hundreds of partners
Solutions
Solutions that drive business results
Deliver customer value and drive business outcomes
Amplitude Solutions →

Industry

Financial Services
Personalize the banking experience
B2B
Maximize product adoption
Media
Identify impactful content
Healthcare
Simplify the digital healthcare experience
Ecommerce
Optimize for transactions

Use Case

Acquisition
Get users hooked from day one
Retention
Understand your customers like no one else
Monetization
Turn behavior into business

Team

Product
Fuel faster growth
Data
Make trusted data accessible
Engineering
Ship faster, learn more
Marketing
Build customers for life
Executive
Power decisions, shape the future

Size

Startups
Free analytics tools for startups
Enterprise
Advanced analytics for scaling businesses
Resources

Learn

Blog
Thought leadership from industry experts
Resource Library
Expertise to guide your growth
Compare
See how we stack up against the competition
Glossary
Learn about analytics, product, and technical terms
Explore Hub
Detailed guides on product and web analytics

Connect

Community
Connect with peers in product analytics
Events
Register for live or virtual events
Customers
Discover why customers love Amplitude
Partners
Accelerate business value through our ecosystem

Support & Services

Customer Help Center
All support resources in one place: policies, customer portal, and request forms
Developer Hub
Integrate and instrument Amplitude
Academy & Training
Become an Amplitude pro
Professional Services
Drive business success with expert guidance and support
Product Updates
See what's new from Amplitude

Tools

Benchmarks
Understand how your product compares
Templates
Kickstart your analysis with custom dashboard templates
Tracking Guides
Learn how to track events and metrics with Amplitude
Maturity Model
Learn more about our digital experience maturity model
Pricing
LoginContact salesGet started

AI

AI AgentsAI VisibilityAI FeedbackAmplitude MCP

Insights

Product AnalyticsMarketing AnalyticsSession ReplayHeatmaps

Action

Guides and SurveysFeature ExperimentationWeb ExperimentationFeature ManagementActivation

Data

Warehouse-native AmplitudeData GovernanceSecurity & PrivacyIntegrations
Amplitude Solutions →

Industry

Financial ServicesB2BMediaHealthcareEcommerce

Use Case

AcquisitionRetentionMonetization

Team

ProductDataEngineeringMarketingExecutive

Size

StartupsEnterprise

Learn

BlogResource LibraryCompareGlossaryExplore Hub

Connect

CommunityEventsCustomersPartners

Support & Services

Customer Help CenterDeveloper HubAcademy & TrainingProfessional ServicesProduct Updates

Tools

BenchmarksTemplatesTracking GuidesMaturity Model
LoginSign Up

Webinar Recap: Elevating the Experimentation Maturity Model

A candid conversation with Ronny Kohavi on why most product ideas fail—and why that’s the best reason to test everything
Insights

Aug 12, 2025

8 min read

Phil Burch

Phil Burch

Former Group Product Marketing Manager, Amplitude

Webinar Recap of a talk from Ronny Kohavi and Jheel Jawharkar

When product intuition and bias collide, experimentation offers a way forward. In Amplitude’s recent webinar, Ronny Kohavi, Ph.D.—renowned experimentation expert and former Executive at Microsoft, Amazon, and Airbnb—joined Jheel Jawharkar, Principal Product Manager at Amplitude, to share decades of lessons from building experimentation platforms and cultures at scale.

What followed was part data science masterclass, part cultural intervention, and all-around a call to action: if you want innovation, you need to test—early, often, and across your entire organization. Read on for a summary of their discussion!

Check out the full recording of the Ronny and Jheel’s webinar, Unleashing Innovation: Elevating the Experimentation Maturity Model.

The experimentation paradox: Most ideas fail

Ronny opens with a hard truth: most product ideas don’t work.

Industry-wide data shows that the vast majority of experiments fail to deliver meaningful gains on core business metrics. At Bing, only 15% of launched experiments succeeded. Even top performers like Google Ads, Netflix, and Booking.com report 10% success rates on average.

So why test?

Because the alternative is building blindly—shipping unvalidated ideas based on opinion or instinct, often backed by the “HiPPO” (Highest Paid Person’s Opinion). And if those don’t succeed, what do you learn?

Experimentation helps you find what works, fail safely, and learn fast. In short, low success rates aren’t a bug—they’re the reason you need experimentation in the first place.

From HiPPO to data-driven: How culture drives outcomes

Throughout the talk, Ronny emphasized that tools alone don’t build great experimentation programs—culture does. And shifting that culture requires challenging some deeply held beliefs:

  • “We know what to do. It’s in our DNA.”
  • “We don’t need to test something we’re going to ship anyway.”
  • “We’ve always done it this way.”

Ronny called this the “Semmelweis Reflex,” referencing the 19th-century doctor who discovered that handwashing dramatically reduced deaths in hospitals—only to be rejected by the medical establishment for decades. Innovation often meets resistance, especially when it contradicts existing power structures or expertise.

The same holds true in product development. Executives and project managers may bristle at experiments that contradict their assumptions. The answer, Ronny says, is to normalize experimentation as a learning tool—not a judgment of someone's ability.

Four stages of experimentation maturity

To help organizations map their path forward, Ronny shared a model for experimentation maturity:

1. Hubris

Teams are frequently overconfident in their instincts. Too often, decisions are based on opinions, not evidence. A data-driven culture shift is crucial for overcoming this stage.

2. Measurement and control

Organizations begin to track metrics and adopt safe deployment practices. They may use feature flags and A/B tests—but usually only after shipping.

3. Accept results (“Semmelweis Reflex”)

Experiments challenge long-held beliefs, and sometimes teams resist unexpected or uncomfortable results. When the “But that’s how we’ve always done it” argument comes in, it’s crucial to trust in the data.

4. Fundamental understanding

Experimentation becomes embedded in every decision. Teams test small ideas regularly, iterate based on results, and embrace failures as opportunities to learn.

Lessons from experience

Ronny’s team scaled Bing from a handful of experiments to more than 1,000 per month. This didn’t happen overnight. Key accelerators included:

  • Top-down support: When Satya Nadella became CEO, experimentation got a mandate.
  • Safe deployment: Feature flags allowed teams to ship with control and gradually introduce experiments.
  • Cultural storytelling: Sharing wins (and surprising failures) helped shift mindsets company-wide.
  • Automation: The cost of running an experiment dropped to near-zero thanks to robust infrastructure.

Other brands saw similar results. When Microsoft Office adopted the same platform, experimentation grew 700% year over year. But Ronny warned that you can’t go from zero experiments to thousands overnight. The groundwork—metrics, culture, tooling—must come first.

Helping the HiPPO

Without data, the HiPPO usually takes precedence. But with it, leaders are far more likely to de-prioritize their personal opinion in lieu of the cold, hard facts.

Ronny introduced a hierarchy of evidence to deliver to the highest-paid person:

  1. Anecdotes and opinions: lowest trust (e.g., “My daughter liked this feature.”)
  2. Observational data: useful, but prone to bias.
  3. Non-randomized tests: better, but causality isn’t clear.
  4. Randomized controlled experiments: the gold standard.
  5. Multiple randomized controlled experiments + meta-analysis: highest confidence.
hierarchy of evidence for experiments

Online, randomized controlled experiments take the form of A/B tests. They allow teams to quantify causal impact with high confidence—and avoid being misled by intuition or correlation.

Building the right metrics and mindset

A successful experimentation culture also requires well-defined metrics. At Bing, Ronny’s team used an Overall Evaluation Criterion (OEC)—a composite score combining revenue, relevance, and user satisfaction.

The goal? Shift accountability from output (“Did we ship it?”) to outcomes (“Did it work?”). That’s when teams start optimizing for impact instead of activity.

Common questions, real answers

During the Q&A, Jheel asked Ronny to address some of the attendees’ questions around experimentation:

My org only ships releases every six months. Can we even experiment?

Yes. Ronny has seen organizations shift their release-cycle culture by solving for real pain points in a controlled environment. Safe deployment, or feature flags, can solve issues on a smaller scale and can lead to wider buy-in for a more agile culture.

Who should own experimentation?

A Center of Excellence model often works best. Centralize best practices, tooling, and training—then enable individual teams to run their own tests.

How do I convince leadership?

Share success rates from industry leaders and emphasize the cost of false confidence. Most code doesn’t help users—experimentation prevents bloat and regression.

How do you test new features without hurting the user experience?

Make small, gradual changes instead of big redesigns. Start testing with your most active users, who will give you the clearest feedback.

Experimentation culture is about learning from mistakes

Ronny closed with an example that sums up experimentation, leadership support, and a data-driven culture. A young executive at IBM attempted a risky venture that ultimately cost the company $10MM. When the executive tried to hand in his resignation, Tom Watson Sr., the founder of IBM, responded, “You can’t be serious! We’ve just spent $10MM educating you.”

For Ronny Kohavi, successful experimentation is tied to metrics, proved by A/B tests, and fueled by a data-driven culture—even if your experiment fails.


Want to learn more?

Ronny and Jheel’s conversation is a goldmine of experimentation insights. Watch the full on-demand webinar to learn more.

Ronny also offers two courses on Maven:

  • Accelerating Innovation with A/B Testing: A two-week deep dive into experimentation strategy
  • Advanced Topics in Practical A/B Testing: For teams already running tests but looking to level up


About the author
Phil Burch

Phil Burch

Former Group Product Marketing Manager, Amplitude

More from Phil

Phil Burch is a former Group Product Marketing Manager for Amplitude Experiment. Phil previously held roles across the customer lifecycle including account management, solutions consulting, and product onboarding before moving into product marketing roles at Sysomos, Hearsay Systems, and Tray.io.

More from Phil
Topics

Experimentation

Recap

Metrics

Recommended Reading

article card image
Read 
Insights
Vibe Check Part 3: When Vibe Marketing Goes Off the Rails

Dec 4, 2025

8 min read

article card image
Read 
Customers
How CAFU Tripled Engagement and Boosted Conversions 20%+

Dec 4, 2025

8 min read

article card image
Read 
Customers
The Future is Data-Driven: Introducing the Winners of the Ampy Awards 2025

Dec 2, 2025

6 min read

article card image
Read 
Insights
Marketing Analytics in 2026: Predictions from the People Who Measure Everything

Nov 25, 2025

9 min read

Platform
  • Product Analytics
  • Feature Experimentation
  • Feature Management
  • Web Analytics
  • Web Experimentation
  • Session Replay
  • Activation
  • Guides and Surveys
  • AI Agents
  • AI Visibility
  • AI Feedback
  • Amplitude MCP
Compare us
  • Adobe
  • Google Analytics
  • Mixpanel
  • Heap
  • Optimizely
  • Fullstory
  • Pendo
Resources
  • Resource Library
  • Blog
  • Product Updates
  • Amp Champs
  • Amplitude Academy
  • Events
  • Glossary
Partners & Support
  • Contact Us
  • Customer Help Center
  • Community
  • Developer Docs
  • Find a Partner
  • Become an affiliate
Company
  • About Us
  • Careers
  • Press & News
  • Investor Relations
  • Diversity, Equity & Inclusion
Terms of ServicePrivacy NoticeAcceptable Use PolicyLegal
EnglishJapanese (日本語)Korean (한국어)Español (Spain)Português (Brasil)Português (Portugal)FrançaisDeutsch
© 2025 Amplitude, Inc. All rights reserved. Amplitude is a registered trademark of Amplitude, Inc.
Blog
InsightsProductCompanyCustomers
Topics

101

AI

APJ

Acquisition

Adobe Analytics

Amplify

Amplitude Academy

Amplitude Activation

Amplitude Analytics

Amplitude Audiences

Amplitude Community

Amplitude Feature Experimentation

Amplitude Guides and Surveys

Amplitude Heatmaps

Amplitude Made Easy

Amplitude Session Replay

Amplitude Web Experimentation

Amplitude on Amplitude

Analytics

B2B SaaS

Behavioral Analytics

Benchmarks

Churn Analysis

Cohort Analysis

Collaboration

Consolidation

Conversion

Customer Experience

Customer Lifetime Value

DEI

Data

Data Governance

Data Management

Data Tables

Digital Experience Maturity

Digital Native

Digital Transformer

EMEA

Ecommerce

Employee Resource Group

Engagement

Event Tracking

Experimentation

Feature Adoption

Financial Services

Funnel Analysis

Getting Started

Google Analytics

Growth

Healthcare

How I Amplitude

Implementation

Integration

LATAM

Life at Amplitude

MCP

Machine Learning

Marketing Analytics

Media and Entertainment

Metrics

Modern Data Series

Monetization

Next Gen Builders

North Star Metric

Partnerships

Personalization

Pioneer Awards

Privacy

Product 50

Product Analytics

Product Design

Product Management

Product Releases

Product Strategy

Product-Led Growth

Recap

Retention

Startup

Tech Stack

The Ampys

Warehouse-native Amplitude