Question 1

What is statistical significance in A/B testing?

Accepted Answer

Statistical significance in A/B testing means that the difference in conversion rates between your control and variant is unlikely to be due to random chance. When a result is statistically significant (typically at p < 0.05 or 95% confidence), you can be reasonably confident that the observed difference reflects a real effect and not just noise in the data.

Question 2

What confidence level should I use for A/B tests?

Accepted Answer

95% confidence (p < 0.05) is the standard for most A/B tests and means there is only a 5% chance of a false positive. For low-risk tests — like copy or color changes — 90% confidence may be acceptable. For high-stakes changes like checkout redesigns or pricing, consider using 99% confidence to minimize risk.

Question 3

How many visitors do I need for an A/B test?

Accepted Answer

The required sample size depends on your baseline conversion rate, the minimum detectable effect you care about, and the statistical power you want (typically 80%). As a rule of thumb, detecting a 10% relative improvement on a 3% conversion rate requires roughly 10,000–15,000 visitors per variant. Use the minimum sample size estimate shown by this calculator for your specific inputs.

Question 4

What is a p-value?

Accepted Answer

The p-value is the probability of observing a difference as large as (or larger than) the one measured, assuming there is actually no difference between the variants. A p-value of 0.05 means there is a 5% chance the result is due to random chance. Lower p-values indicate stronger evidence that the difference is real.

Question 5

What is the difference between statistical and practical significance?

Accepted Answer

Statistical significance tells you whether a difference is real and unlikely due to chance. Practical significance (also called effect size or business significance) tells you whether the difference is large enough to matter for your business. A test can be statistically significant but have such a tiny uplift (e.g., 0.1%) that it is not worth acting on. Always evaluate both the p-value and the relative uplift together.

A/B Test Calculator

Frequently Asked Questions

Frequently Asked Questions

Related tools