SlideShare una empresa de Scribd logo
1 de 40
The Art of A/B Testing
@mike_greenfield
numeratechoir.com
500 Startups, 2013-05-08
What’s an A/B Test?
Definition: An A/B Test is a means by which
a product’s users are randomly given
one of two or more experiences.
Usage: Companies use A/B Tests to
discover which experience is most
effective.
A/B Testing Overview
Why You Should A/B Test
What You Should A/B Test
When You Should A/B Test
How You Should A/B Test
What I’ve Learned from A/B Testing
Why You Should A/B Test
Humans’ quantitative intuition is poor
Beliefs:
1) “I have great product intuition”
2) “This is business, not science”
3) “It leads to a local maximum”
Reality:
At Internet scale, testing and measuring
properly has a huge return on investment.
“I have great product intuition”
• Improved signup flow: often a disaster
• Sharing on Facebook: can be good or
bad
• Home page changes: usually a mixed
bag
…but product changes != progress
“This is business, not science”
• Requires oversight and sleuthing
• Only works for big improvements
• Requires other factors to stay the same
…but change-then-measure is flawed
Test Test Type Goal
Change Button
Color
Optimization Increased clickthrough
Old Site Design
vs. New Site
Design
Holistic (aka don’t
shoot yourself in
the foot)
Make an informed
decision on old vs. new
“It leads to a local maximum”
…so test holistically
What You Should A/B Test
Test an almost viral flow
1000
signups
300
inviting
friends
3000
invitations
1500
invitation
clicks
900
signups
K-factor = 0.9
☑ A/B Test this flow!
Don’t test a non-viral flow
1000
signups
100
inviting
friends
600
invitations
200
invitation
clicks
80
signups
K-factor = 0.08
Spend time elsewhere
Test email
subject lines
Subject: The Craziest Thing
My Child Has Done
Subject: 5 Embarrassing Kid
Moments
vs.
Test email
subject lines
Subject: The Craziest Thing
My Child Has Done
Subject: 5 Embarrassing Kid
Moments
vs.
+153%
Improve purchase funnel
Get 2011 Picks
Get Bracketbrains
vs.
Improve purchase funnel
Get 2011 Picks
Get Bracketbrains
vs.
+30%
Validate your new design*
* if you have scale
Scale What to Do Reasoning
Early stage Just change it. Nothing to lose, no data to
test.
Something to lose A/B test it. The existing product is
probably more effective
than you think.
Understand the
consequences of an
“upgrade.”
When You Should A/B Test
Test only if you can get to
statistical significance
(Google: “split test calculator”)
Big: 200,000
emails with a 3%
CTR option and a
4% CTR option
-------------------------
6000 vs. 8000
Small: 200 emails
with a 3% CTR
option and a 4%
CTR option
--------------------------
6 vs. 8
Test only if you can get to business
significance
• Only things that can cumulatively
have a meaningful impact on your
business
• For emails, a small list means small
improvements: don’t test
• For virality, small changes matter if and
only if you’re close to K=1
Rule of thumb: test every user-
facing change that will be seen
by 10,000-100,000 people
How You Should A/B Test
Okay Choice: Use Commercial
Tools
• MixPanel
• Unbounce
• Optimizely
• Google Analytics
Best Choice: Build Your Own
Framework
• Yep, it’s work with no immediate
payoff
• Your mom won’t care
• Your users won’t care
But…
• There are simple ways to get started
• It gives you tons of flexibility
Why Build Your Own
• Incorporate tests in many places
(page ordering, new designs, email
content, email strategy, mobile)
• Look at results holistically
• Go back and see how any test
influences anything, not just the stats
you’re tracking
Code
It needs to be super simple to create a test
{ab_test_if test=“signup_reason”
option=“awesome” user=$viewer}
because it’s awesome!
{/ab_test_if}
{ab_test_if test=“signup_reason” option=“free”
user=$viewer}
because it’s free!
{/ab_test_if}
Data Structure
AB_TESTS (id, name, time_created)
AB_TEST_OPTIONS
(id, ab_test_id, weight, name)
USER_AB_TEST_OPTIONS
(id, user_id/visitor_id, ab_test_option_id, t
ime_created)
Reporting
select
AB_TEST_OPTION_ID,
ACTIVITY_ID,
count(distinct USER_ID) USERS_DOING_ACTIVITY
from USER_ACTIVITY a, USER_AB_TEST_OPTIONS b
where a.USER_ID=b.USER_ID
and AB_TEST_OPTION_ID in (…)
and ACTIVITY_ID in (…)
and a.time_created>b.time_created
GROUP BY AB_TEST_OPTION_ID, ACTIVITY_ID;
Scaling
• An A/B testing system can yield a lot of
DB writes
• Reporting means many long-running
SQL queries
• Need to batch several aspects
Run Some A/A Tests
• Test two versions of the same thing
• If results are wildly off, something’s
wrong with the testing system
• Deciding too early is a major issue: it’s
usually best to be conservative before
choosing a winner
More framework implementation
details: bit.ly/artofabtesting
What I’ve learned from A/B testing
Focus on 1 item in emails
• Clear subject focused on that item (Why
the Giants will win the World Series)
• Body of text focused on that item
(peripheral content is okay on the
periphery)
• Clear, big clickthrough action in the
email body (See why the Giants will win)
• Require clickthrough to get the full story
In signup, minimize distraction
• Provide context/messaging of what
the product is, but don’t make it
clickable
• Clear “next” or “continue” steps to
guide user through the process
• Remove unnecessary navigation
Highlight friends, not your product
• Most effective: your friends are doing
something; you should join them
• Unless you’re Apple, no one cares
about your new feature or new design
• People probably don’t care about
your fancy new social network
• “Join my circle because I trust you”
beats “check out this great product”
When possible,
TELL USERS WHAT TO DO
3 Things to Remember
A/B testing = good culture
• Data trump opinions
• Iterate quickly but intelligently
• Everyone gets better at predicting
product success
Test changes if they’re likely to
have both statistical and business
significance
Validate the Big Stuff
A/B Test Holistically; testing
isn’t a substitute for product
vision.
Optimize the Small Stuff
The details matter more than you
think.
Thanks.
mike@mikegreenfield.com
@mike_greenfield

Más contenido relacionado

Destacado

10 Mobile Marketing Campaigns That Went Viral and Made Millions
10 Mobile Marketing Campaigns That Went Viral and Made Millions10 Mobile Marketing Campaigns That Went Viral and Made Millions
10 Mobile Marketing Campaigns That Went Viral and Made MillionsMark Fidelman
 
The Science of Marketing Automation
The Science of Marketing AutomationThe Science of Marketing Automation
The Science of Marketing AutomationHubSpot
 
Stop Leaving Money on the Table! Optimizing your Site for Users and Revenue
Stop Leaving Money on the Table! Optimizing your Site for Users and RevenueStop Leaving Money on the Table! Optimizing your Site for Users and Revenue
Stop Leaving Money on the Table! Optimizing your Site for Users and RevenueJosh Patrice
 
10 Ways You're Using AdWords Wrong and How to Correct Those Practices
10 Ways You're Using AdWords Wrong and How to Correct Those Practices 10 Ways You're Using AdWords Wrong and How to Correct Those Practices
10 Ways You're Using AdWords Wrong and How to Correct Those Practices Kissmetrics on SlideShare
 
The Beginners Guide to Startup PR #startuppr
The Beginners Guide to Startup PR #startupprThe Beginners Guide to Startup PR #startuppr
The Beginners Guide to Startup PR #startupprOnboardly
 
Brenda Spoonemore - A biz dev playbook for startups: Why, when and how to do ...
Brenda Spoonemore - A biz dev playbook for startups: Why, when and how to do ...Brenda Spoonemore - A biz dev playbook for startups: Why, when and how to do ...
Brenda Spoonemore - A biz dev playbook for startups: Why, when and how to do ...GeekWire
 
Biz Dev 101 - An Interactive Workshop on How Deals Get Done
Biz Dev 101 - An Interactive Workshop on How Deals Get DoneBiz Dev 101 - An Interactive Workshop on How Deals Get Done
Biz Dev 101 - An Interactive Workshop on How Deals Get DoneScott Pollack
 
Google Analytics Fundamentals: Set Up and Basics for Measurement
Google Analytics Fundamentals: Set Up and Basics for MeasurementGoogle Analytics Fundamentals: Set Up and Basics for Measurement
Google Analytics Fundamentals: Set Up and Basics for MeasurementOrbit Media Studios
 
Mastering Google Adwords In 30 Minutes
Mastering Google Adwords In 30 MinutesMastering Google Adwords In 30 Minutes
Mastering Google Adwords In 30 MinutesNik Cree
 
No excuses user research
No excuses user researchNo excuses user research
No excuses user researchLily Dart
 
How Top Brands Use Referral Programs to Drive Customer Acquisition
How Top Brands Use Referral Programs to Drive Customer AcquisitionHow Top Brands Use Referral Programs to Drive Customer Acquisition
How Top Brands Use Referral Programs to Drive Customer AcquisitionKissmetrics on SlideShare
 
How to: Viral Marketing + Brand Storytelling
How to: Viral Marketing + Brand Storytelling How to: Viral Marketing + Brand Storytelling
How to: Viral Marketing + Brand Storytelling Elle Shelley
 
Wireframes - a brief overview
Wireframes - a brief overviewWireframes - a brief overview
Wireframes - a brief overviewJenni Leder
 
A Guide to User Research (for People Who Don't Like Talking to Other People)
A Guide to User Research (for People Who Don't Like Talking to Other People)A Guide to User Research (for People Who Don't Like Talking to Other People)
A Guide to User Research (for People Who Don't Like Talking to Other People)Stephanie Wills
 
Using Your Growth Model to Drive Smarter High Tempo Testing
Using Your Growth Model to Drive Smarter High Tempo TestingUsing Your Growth Model to Drive Smarter High Tempo Testing
Using Your Growth Model to Drive Smarter High Tempo TestingSean Ellis
 
SQL Tutorial for Marketers
SQL Tutorial for MarketersSQL Tutorial for Marketers
SQL Tutorial for MarketersJustin Mares
 
How to Plug a Leaky Sales Funnel With Facebook Retargeting
How to Plug a Leaky Sales Funnel With Facebook RetargetingHow to Plug a Leaky Sales Funnel With Facebook Retargeting
How to Plug a Leaky Sales Funnel With Facebook RetargetingDigital Marketer
 
Understand A/B Testing in 9 use cases & 7 mistakes
Understand A/B Testing in 9 use cases & 7 mistakesUnderstand A/B Testing in 9 use cases & 7 mistakes
Understand A/B Testing in 9 use cases & 7 mistakesTheFamily
 
User experience doesn't happen on a screen: It happens in the mind.
User experience doesn't happen on a screen: It happens in the mind.User experience doesn't happen on a screen: It happens in the mind.
User experience doesn't happen on a screen: It happens in the mind.John Whalen
 

Destacado (20)

10 Mobile Marketing Campaigns That Went Viral and Made Millions
10 Mobile Marketing Campaigns That Went Viral and Made Millions10 Mobile Marketing Campaigns That Went Viral and Made Millions
10 Mobile Marketing Campaigns That Went Viral and Made Millions
 
The Science of Marketing Automation
The Science of Marketing AutomationThe Science of Marketing Automation
The Science of Marketing Automation
 
Stop Leaving Money on the Table! Optimizing your Site for Users and Revenue
Stop Leaving Money on the Table! Optimizing your Site for Users and RevenueStop Leaving Money on the Table! Optimizing your Site for Users and Revenue
Stop Leaving Money on the Table! Optimizing your Site for Users and Revenue
 
10 Ways You're Using AdWords Wrong and How to Correct Those Practices
10 Ways You're Using AdWords Wrong and How to Correct Those Practices 10 Ways You're Using AdWords Wrong and How to Correct Those Practices
10 Ways You're Using AdWords Wrong and How to Correct Those Practices
 
The Beginners Guide to Startup PR #startuppr
The Beginners Guide to Startup PR #startupprThe Beginners Guide to Startup PR #startuppr
The Beginners Guide to Startup PR #startuppr
 
Brenda Spoonemore - A biz dev playbook for startups: Why, when and how to do ...
Brenda Spoonemore - A biz dev playbook for startups: Why, when and how to do ...Brenda Spoonemore - A biz dev playbook for startups: Why, when and how to do ...
Brenda Spoonemore - A biz dev playbook for startups: Why, when and how to do ...
 
Biz Dev 101 - An Interactive Workshop on How Deals Get Done
Biz Dev 101 - An Interactive Workshop on How Deals Get DoneBiz Dev 101 - An Interactive Workshop on How Deals Get Done
Biz Dev 101 - An Interactive Workshop on How Deals Get Done
 
Google Analytics Fundamentals: Set Up and Basics for Measurement
Google Analytics Fundamentals: Set Up and Basics for MeasurementGoogle Analytics Fundamentals: Set Up and Basics for Measurement
Google Analytics Fundamentals: Set Up and Basics for Measurement
 
Mastering Google Adwords In 30 Minutes
Mastering Google Adwords In 30 MinutesMastering Google Adwords In 30 Minutes
Mastering Google Adwords In 30 Minutes
 
No excuses user research
No excuses user researchNo excuses user research
No excuses user research
 
HTML & CSS Masterclass
HTML & CSS MasterclassHTML & CSS Masterclass
HTML & CSS Masterclass
 
How Top Brands Use Referral Programs to Drive Customer Acquisition
How Top Brands Use Referral Programs to Drive Customer AcquisitionHow Top Brands Use Referral Programs to Drive Customer Acquisition
How Top Brands Use Referral Programs to Drive Customer Acquisition
 
How to: Viral Marketing + Brand Storytelling
How to: Viral Marketing + Brand Storytelling How to: Viral Marketing + Brand Storytelling
How to: Viral Marketing + Brand Storytelling
 
Wireframes - a brief overview
Wireframes - a brief overviewWireframes - a brief overview
Wireframes - a brief overview
 
A Guide to User Research (for People Who Don't Like Talking to Other People)
A Guide to User Research (for People Who Don't Like Talking to Other People)A Guide to User Research (for People Who Don't Like Talking to Other People)
A Guide to User Research (for People Who Don't Like Talking to Other People)
 
Using Your Growth Model to Drive Smarter High Tempo Testing
Using Your Growth Model to Drive Smarter High Tempo TestingUsing Your Growth Model to Drive Smarter High Tempo Testing
Using Your Growth Model to Drive Smarter High Tempo Testing
 
SQL Tutorial for Marketers
SQL Tutorial for MarketersSQL Tutorial for Marketers
SQL Tutorial for Marketers
 
How to Plug a Leaky Sales Funnel With Facebook Retargeting
How to Plug a Leaky Sales Funnel With Facebook RetargetingHow to Plug a Leaky Sales Funnel With Facebook Retargeting
How to Plug a Leaky Sales Funnel With Facebook Retargeting
 
Understand A/B Testing in 9 use cases & 7 mistakes
Understand A/B Testing in 9 use cases & 7 mistakesUnderstand A/B Testing in 9 use cases & 7 mistakes
Understand A/B Testing in 9 use cases & 7 mistakes
 
User experience doesn't happen on a screen: It happens in the mind.
User experience doesn't happen on a screen: It happens in the mind.User experience doesn't happen on a screen: It happens in the mind.
User experience doesn't happen on a screen: It happens in the mind.
 

Más de 500 Startups (20)

Get on Board
Get on BoardGet on Board
Get on Board
 
Connected Analytics
Connected AnalyticsConnected Analytics
Connected Analytics
 
Sira Medical
Sira MedicalSira Medical
Sira Medical
 
The Atlas
The AtlasThe Atlas
The Atlas
 
Trash Warrior
Trash WarriorTrash Warrior
Trash Warrior
 
Thematic
ThematicThematic
Thematic
 
Shiplyst
ShiplystShiplyst
Shiplyst
 
Renetec
RenetecRenetec
Renetec
 
Predina
PredinaPredina
Predina
 
Pluto
PlutoPluto
Pluto
 
Plant an App
Plant an AppPlant an App
Plant an App
 
Pilota
PilotaPilota
Pilota
 
Mero Technologies
Mero TechnologiesMero Technologies
Mero Technologies
 
Omnitron Sensors
Omnitron SensorsOmnitron Sensors
Omnitron Sensors
 
Juked
JukedJuked
Juked
 
GamerzClass
GamerzClassGamerzClass
GamerzClass
 
eino
einoeino
eino
 
Cenos
CenosCenos
Cenos
 
Bliinx
BliinxBliinx
Bliinx
 
Butlr
ButlrButlr
Butlr
 

Último

Long journey of Ruby standard library at RubyConf AU 2024
Long journey of Ruby standard library at RubyConf AU 2024Long journey of Ruby standard library at RubyConf AU 2024
Long journey of Ruby standard library at RubyConf AU 2024Hiroshi SHIBATA
 
Abdul Kader Baba- Managing Cybersecurity Risks and Compliance Requirements i...
Abdul Kader Baba- Managing Cybersecurity Risks  and Compliance Requirements i...Abdul Kader Baba- Managing Cybersecurity Risks  and Compliance Requirements i...
Abdul Kader Baba- Managing Cybersecurity Risks and Compliance Requirements i...itnewsafrica
 
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxMerck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxLoriGlavin3
 
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptxThe Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptxLoriGlavin3
 
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyes
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyesHow to Effectively Monitor SD-WAN and SASE Environments with ThousandEyes
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyesThousandEyes
 
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxThe Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxLoriGlavin3
 
QCon London: Mastering long-running processes in modern architectures
QCon London: Mastering long-running processes in modern architecturesQCon London: Mastering long-running processes in modern architectures
QCon London: Mastering long-running processes in modern architecturesBernd Ruecker
 
The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...
The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...
The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...Wes McKinney
 
Glenn Lazarus- Why Your Observability Strategy Needs Security Observability
Glenn Lazarus- Why Your Observability Strategy Needs Security ObservabilityGlenn Lazarus- Why Your Observability Strategy Needs Security Observability
Glenn Lazarus- Why Your Observability Strategy Needs Security Observabilityitnewsafrica
 
Data governance with Unity Catalog Presentation
Data governance with Unity Catalog PresentationData governance with Unity Catalog Presentation
Data governance with Unity Catalog PresentationKnoldus Inc.
 
The State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptxThe State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptxLoriGlavin3
 
How to write a Business Continuity Plan
How to write a Business Continuity PlanHow to write a Business Continuity Plan
How to write a Business Continuity PlanDatabarracks
 
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxUse of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxLoriGlavin3
 
Bridging Between CAD & GIS: 6 Ways to Automate Your Data Integration
Bridging Between CAD & GIS:  6 Ways to Automate Your Data IntegrationBridging Between CAD & GIS:  6 Ways to Automate Your Data Integration
Bridging Between CAD & GIS: 6 Ways to Automate Your Data Integrationmarketing932765
 
Decarbonising Buildings: Making a net-zero built environment a reality
Decarbonising Buildings: Making a net-zero built environment a realityDecarbonising Buildings: Making a net-zero built environment a reality
Decarbonising Buildings: Making a net-zero built environment a realityIES VE
 
Top 10 Hubspot Development Companies in 2024
Top 10 Hubspot Development Companies in 2024Top 10 Hubspot Development Companies in 2024
Top 10 Hubspot Development Companies in 2024TopCSSGallery
 
Design pattern talk by Kaya Weers - 2024 (v2)
Design pattern talk by Kaya Weers - 2024 (v2)Design pattern talk by Kaya Weers - 2024 (v2)
Design pattern talk by Kaya Weers - 2024 (v2)Kaya Weers
 
Scale your database traffic with Read & Write split using MySQL Router
Scale your database traffic with Read & Write split using MySQL RouterScale your database traffic with Read & Write split using MySQL Router
Scale your database traffic with Read & Write split using MySQL RouterMydbops
 
A Framework for Development in the AI Age
A Framework for Development in the AI AgeA Framework for Development in the AI Age
A Framework for Development in the AI AgeCprime
 
TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024Lonnie McRorey
 

Último (20)

Long journey of Ruby standard library at RubyConf AU 2024
Long journey of Ruby standard library at RubyConf AU 2024Long journey of Ruby standard library at RubyConf AU 2024
Long journey of Ruby standard library at RubyConf AU 2024
 
Abdul Kader Baba- Managing Cybersecurity Risks and Compliance Requirements i...
Abdul Kader Baba- Managing Cybersecurity Risks  and Compliance Requirements i...Abdul Kader Baba- Managing Cybersecurity Risks  and Compliance Requirements i...
Abdul Kader Baba- Managing Cybersecurity Risks and Compliance Requirements i...
 
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxMerck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
 
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptxThe Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
 
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyes
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyesHow to Effectively Monitor SD-WAN and SASE Environments with ThousandEyes
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyes
 
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxThe Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
 
QCon London: Mastering long-running processes in modern architectures
QCon London: Mastering long-running processes in modern architecturesQCon London: Mastering long-running processes in modern architectures
QCon London: Mastering long-running processes in modern architectures
 
The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...
The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...
The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...
 
Glenn Lazarus- Why Your Observability Strategy Needs Security Observability
Glenn Lazarus- Why Your Observability Strategy Needs Security ObservabilityGlenn Lazarus- Why Your Observability Strategy Needs Security Observability
Glenn Lazarus- Why Your Observability Strategy Needs Security Observability
 
Data governance with Unity Catalog Presentation
Data governance with Unity Catalog PresentationData governance with Unity Catalog Presentation
Data governance with Unity Catalog Presentation
 
The State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptxThe State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptx
 
How to write a Business Continuity Plan
How to write a Business Continuity PlanHow to write a Business Continuity Plan
How to write a Business Continuity Plan
 
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxUse of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
 
Bridging Between CAD & GIS: 6 Ways to Automate Your Data Integration
Bridging Between CAD & GIS:  6 Ways to Automate Your Data IntegrationBridging Between CAD & GIS:  6 Ways to Automate Your Data Integration
Bridging Between CAD & GIS: 6 Ways to Automate Your Data Integration
 
Decarbonising Buildings: Making a net-zero built environment a reality
Decarbonising Buildings: Making a net-zero built environment a realityDecarbonising Buildings: Making a net-zero built environment a reality
Decarbonising Buildings: Making a net-zero built environment a reality
 
Top 10 Hubspot Development Companies in 2024
Top 10 Hubspot Development Companies in 2024Top 10 Hubspot Development Companies in 2024
Top 10 Hubspot Development Companies in 2024
 
Design pattern talk by Kaya Weers - 2024 (v2)
Design pattern talk by Kaya Weers - 2024 (v2)Design pattern talk by Kaya Weers - 2024 (v2)
Design pattern talk by Kaya Weers - 2024 (v2)
 
Scale your database traffic with Read & Write split using MySQL Router
Scale your database traffic with Read & Write split using MySQL RouterScale your database traffic with Read & Write split using MySQL Router
Scale your database traffic with Read & Write split using MySQL Router
 
A Framework for Development in the AI Age
A Framework for Development in the AI AgeA Framework for Development in the AI Age
A Framework for Development in the AI Age
 
TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024
 

A/B Testing with Mike Greenfield

  • 1. The Art of A/B Testing @mike_greenfield numeratechoir.com 500 Startups, 2013-05-08
  • 2. What’s an A/B Test? Definition: An A/B Test is a means by which a product’s users are randomly given one of two or more experiences. Usage: Companies use A/B Tests to discover which experience is most effective.
  • 3. A/B Testing Overview Why You Should A/B Test What You Should A/B Test When You Should A/B Test How You Should A/B Test What I’ve Learned from A/B Testing
  • 4. Why You Should A/B Test
  • 5. Humans’ quantitative intuition is poor Beliefs: 1) “I have great product intuition” 2) “This is business, not science” 3) “It leads to a local maximum” Reality: At Internet scale, testing and measuring properly has a huge return on investment.
  • 6. “I have great product intuition” • Improved signup flow: often a disaster • Sharing on Facebook: can be good or bad • Home page changes: usually a mixed bag …but product changes != progress
  • 7. “This is business, not science” • Requires oversight and sleuthing • Only works for big improvements • Requires other factors to stay the same …but change-then-measure is flawed
  • 8. Test Test Type Goal Change Button Color Optimization Increased clickthrough Old Site Design vs. New Site Design Holistic (aka don’t shoot yourself in the foot) Make an informed decision on old vs. new “It leads to a local maximum” …so test holistically
  • 9. What You Should A/B Test
  • 10. Test an almost viral flow 1000 signups 300 inviting friends 3000 invitations 1500 invitation clicks 900 signups K-factor = 0.9 ☑ A/B Test this flow!
  • 11. Don’t test a non-viral flow 1000 signups 100 inviting friends 600 invitations 200 invitation clicks 80 signups K-factor = 0.08 Spend time elsewhere
  • 12. Test email subject lines Subject: The Craziest Thing My Child Has Done Subject: 5 Embarrassing Kid Moments vs.
  • 13. Test email subject lines Subject: The Craziest Thing My Child Has Done Subject: 5 Embarrassing Kid Moments vs. +153%
  • 14. Improve purchase funnel Get 2011 Picks Get Bracketbrains vs.
  • 15. Improve purchase funnel Get 2011 Picks Get Bracketbrains vs. +30%
  • 16. Validate your new design* * if you have scale Scale What to Do Reasoning Early stage Just change it. Nothing to lose, no data to test. Something to lose A/B test it. The existing product is probably more effective than you think. Understand the consequences of an “upgrade.”
  • 17. When You Should A/B Test
  • 18. Test only if you can get to statistical significance (Google: “split test calculator”) Big: 200,000 emails with a 3% CTR option and a 4% CTR option ------------------------- 6000 vs. 8000 Small: 200 emails with a 3% CTR option and a 4% CTR option -------------------------- 6 vs. 8
  • 19. Test only if you can get to business significance • Only things that can cumulatively have a meaningful impact on your business • For emails, a small list means small improvements: don’t test • For virality, small changes matter if and only if you’re close to K=1
  • 20. Rule of thumb: test every user- facing change that will be seen by 10,000-100,000 people
  • 21. How You Should A/B Test
  • 22. Okay Choice: Use Commercial Tools • MixPanel • Unbounce • Optimizely • Google Analytics
  • 23. Best Choice: Build Your Own Framework • Yep, it’s work with no immediate payoff • Your mom won’t care • Your users won’t care But… • There are simple ways to get started • It gives you tons of flexibility
  • 24. Why Build Your Own • Incorporate tests in many places (page ordering, new designs, email content, email strategy, mobile) • Look at results holistically • Go back and see how any test influences anything, not just the stats you’re tracking
  • 25. Code It needs to be super simple to create a test {ab_test_if test=“signup_reason” option=“awesome” user=$viewer} because it’s awesome! {/ab_test_if} {ab_test_if test=“signup_reason” option=“free” user=$viewer} because it’s free! {/ab_test_if}
  • 26. Data Structure AB_TESTS (id, name, time_created) AB_TEST_OPTIONS (id, ab_test_id, weight, name) USER_AB_TEST_OPTIONS (id, user_id/visitor_id, ab_test_option_id, t ime_created)
  • 27. Reporting select AB_TEST_OPTION_ID, ACTIVITY_ID, count(distinct USER_ID) USERS_DOING_ACTIVITY from USER_ACTIVITY a, USER_AB_TEST_OPTIONS b where a.USER_ID=b.USER_ID and AB_TEST_OPTION_ID in (…) and ACTIVITY_ID in (…) and a.time_created>b.time_created GROUP BY AB_TEST_OPTION_ID, ACTIVITY_ID;
  • 28. Scaling • An A/B testing system can yield a lot of DB writes • Reporting means many long-running SQL queries • Need to batch several aspects
  • 29. Run Some A/A Tests • Test two versions of the same thing • If results are wildly off, something’s wrong with the testing system • Deciding too early is a major issue: it’s usually best to be conservative before choosing a winner
  • 30. More framework implementation details: bit.ly/artofabtesting
  • 31. What I’ve learned from A/B testing
  • 32. Focus on 1 item in emails • Clear subject focused on that item (Why the Giants will win the World Series) • Body of text focused on that item (peripheral content is okay on the periphery) • Clear, big clickthrough action in the email body (See why the Giants will win) • Require clickthrough to get the full story
  • 33. In signup, minimize distraction • Provide context/messaging of what the product is, but don’t make it clickable • Clear “next” or “continue” steps to guide user through the process • Remove unnecessary navigation
  • 34. Highlight friends, not your product • Most effective: your friends are doing something; you should join them • Unless you’re Apple, no one cares about your new feature or new design • People probably don’t care about your fancy new social network • “Join my circle because I trust you” beats “check out this great product”
  • 36. 3 Things to Remember
  • 37. A/B testing = good culture • Data trump opinions • Iterate quickly but intelligently • Everyone gets better at predicting product success
  • 38. Test changes if they’re likely to have both statistical and business significance
  • 39. Validate the Big Stuff A/B Test Holistically; testing isn’t a substitute for product vision. Optimize the Small Stuff The details matter more than you think.