Question 1

What does "real winner" mean?

Accepted Answer

It means the gap between your two versions isn't a fluke. At 95% sure, there's only a 5% chance the gap you see comes from luck. It does *not* mean the gap is big or worth shipping. A tiny lift can be a "real" winner if you have huge traffic. Always look at the lift and the money, not just the verdict.

Question 2

How sure should I be?

Accepted Answer

95% is the standard and works for most tests. Pick 99% if the change is hard to undo (full page redesign, checkout flow). 90% is fine for quick reads where being wrong is cheap (a button-copy tweak, a small headline change). Higher means more data, so the trade-off is always: how sure do I need to be vs how fast do I want an answer.

Question 3

My test hit 95% then dropped back. What happened?

Accepted Answer

You peeked. A/B test numbers bounce around during the run, especially in the first few thousand visitors. The whole point of setting a target is to call the test *once*, at the end of a sample size you picked up front. Reading partway through and reacting is called [peeking](/null-hypothesis-ab-testing), and it makes you wrong more often. Calculate your [sample size up front](/sample-size-calculator) and wait.

Question 4

Can a winner not be worth shipping?

Accepted Answer

Yes, and this is the bigger trap than people think. The verdict tells you the gap is real. It doesn't tell you it's worth your time. A 0.1 percentage-point lift can be "real" with enough traffic, but if your monthly revenue moves by €40, the engineering time wasn't worth it. That's what **The money** section is for. Put a euro on it, then decide.

Question 5

How many visitors do I need before checking?

Accepted Answer

Calculate the [sample size up front](/sample-size-calculator) and don't check until you hit it. Rough sanity check: under about 100 conversions per version, the headline number is jumpy. Don't trust strong-looking verdicts under that. If you've already started without planning, finish to a clean two-week window (a full business cycle) before reading.

Question 6

What if my variation is worse?

Accepted Answer

Same math, different verdict. The calculator shows Version A as the winner and the result card goes red. That's useful information, not a failure. You ruled out a worse design. Keep the original and try something more different next time. **Got a winner?** Go with it. **Stuck?** [Kirro](https://app.kirro.io/auth/sign-up) figures out what to test next, writes the change for you, and runs the test. From "should I go with this?" to "what's next?", all in one tool.

Did my A/B test actually work?

Your test data

Real winner

What's this worth?

How sure are we?

How to use this calculator

How we calculate this

The chance Version B wins (the headline number)

Why “Clear loss” can show up

The money section

What to do with your results

FAQ

What does “real winner” mean?

How sure should I be?

My test hit 95% then dropped back. What happened?

Can a winner not be worth shipping?

How many visitors do I need before checking?

What if my variation is worse?

Related reading: Testing Methodology

11 A/B testing mistakes that quietly kill your results

A/B test sample size formula: how to calculate it (with worked examples)

A/B testing conversion rate: how to measure, track, and actually improve it

Launch your A/B test for free