Key statistical definitions for AB Testing

Hypothesis testing

This is a statistical technique to detect whether there is no difference between two samples of data. In an AB Test, we are interested in whether our variation is better than the control. In other words, will the conversion rate be better for the variation than the control. The most difficult concept to grasp here is that a hypothesis only detects a lack of difference, rather than whether there is a difference.

Significance

This is basically the threshold at which we would consider there to be a significant difference, and is typically set at 0.1, 0.05 or 0.01. This significance level determines how much weight is given to the extreme instances of a test. If you choose a smaller significance level, there would be a much smaller margin for your test to be significant with an extreme result.

Confidence

Confidence is more commonly associated with confidence intervals and is isolated to your test, however it is directly related to significance. If you want to be 90% confident, then you would set your significance at a 0.1 level. Intuitively, this makes sense since if you want to be more confident that you have a significant test, then you want a smaller margin for extreme results. So, the confidence and significance level scale appropriately.

P-values

As we said earlier (and even used before), the significance level is a threshold and it’s the p-value that is the measuring stick here. Statistically, the p-value is the probability that the test is part of the null distribution. In normal AB Testing speak, this is the probability that there is not a difference between the variant and the control, and that the difference you’ve found is completely by chance.

Each one of these plays a part in a basic AB Test, from constructing your hypothesis, to conducting your test and analysing the results. It’s important to understand that with a hypothesis test, the test will never tell you whether there is a difference, only evidence to suggest there is no difference. And even then, you control whether you determine a test is a significant test or not.

Edit plot a course with Burgess Yachts

May 24, 2021

Edit has been appointed by Burgess Yachts to support the implementation of a major marketing technology programme for the business, concentrated on enhancing the usage of customer data. With a heritage spanning close to 50 years, Burgess Yachts was founded in 1975 by...

Customer Insights & the wider Microsoft tech stack

Mar 29, 2021

Customer Data Platforms (CDP) are growing faster than almost any other marketing technology. In simple terms, a CDP is a master database of your customers and all related information – detailing everything from their contact details to their interactions with your...

Edit under the hood: Andy Aldersley and Sean Longthorpe

Mar 12, 2021

Meet a couple of the clever people behind our intelligent data pillar – our team at the helm of turning data into profit. Senior Data Scientist Andy Aldersley and Senior Insights Analyst Sean Longthorpe discussed the work their team does, the value intelligent data...

How AI spotted coronavirus before it went viral

Mar 5, 2020

With the world gripped by the Coronavirus (COVID-19) epidemic, it’s clear that the main ways to mitigate the impact are through personal hygiene (hand washing), increasing social distance (keeping away from people) and imposing quarantine for effected areas and...

Connected Digital Journeys in Pharma

Feb 12, 2020

In pharma, ‘digital’ is no longer merely a channel for tactical execution. It is at the core of both commercial and business planning. Clinicians, healthcare professionals and administrators are engaging with both ‘digital’ and ‘tech’ as part of their day-to-day. Not...

The Way We Watch: 2020’s Streaming Landscape

Feb 7, 2020

The broadcast streaming landscape is a dynamic environment. How has the explosion of rival streaming services affected the TV audience landscape? Here's what you need to know. The Impact on audiences – based on Ofcom’s Media Nations Report 2019 Across all age groups,...

Secure to not secure: What you need to know about Google’s TLS announcement

Jan 16, 2020

Secure to not secure: What you need to know about Google’s TLS announcement Google is cracking down on sites using "non-secure" versions of TLS (Transport Layer Security). To keep your site secure and your users feeling safe, it’s important you make sure you update...

7 pitfalls when using Google Data Studio

Jul 9, 2018

Ok the title is click baity - Google Data Studio is an awesome tool and many of you are probably already using it on some level. For those of you that don’t know Google Data Studio, it’s time to get on board. After all, it’s free and allows you to simplify the...

Keyword Rankings and Forecasting in SEO using Shiny

Jun 28, 2018

Previously, we’ve seen how R can be used to retrieve data from APIs such as Google Analytics. Often with data, you’ll conduct the same type of analysis repeatedly, using the same kind of code, for various projects. To make life easier, wouldn’t you want to build a...

Using GTM to avoid (not set) in Google Analytics

Jun 26, 2018

If you use a Google Tag Manager variable whose type is “number” to send data to Google Analytics, you may notice that some of your data comes through as (not set), rather than the value you wanted. The example above shows an event label which is pulled in from the...

Hypothesis testing

Significance

Confidence

P-values

Summary:

Related Posts

Discover

Explore

Social

Subscribe to our newsletter