Testing Methodology · 12 Jun, 2026

Control group: what it is, why it matters, and how it works in any experiment

đź§«
🎯

What is a control group?

A control group is the unchanged version in an experiment. It’s the baseline you compare everything else against.

A control group is the group in an experiment that doesn’t receive any new treatment. It stays exactly as-is. The whole point is to give you something stable to compare your changes against.

Without that baseline, you can’t tell whether your change caused the result or something else did.

Think of it like a cooking experiment. You make two batches of cookies. One batch uses the original recipe (that’s your control group). The other swaps butter for coconut oil.

If the coconut oil batch tastes better, you know why. Without the original recipe as your baseline, you’re just guessing.

In split testing, the control group is your current page. Version A. The one that’s already live. You’re comparing it against a new version (Version B) to see which performs better.

You’ll also hear people call it the “comparison group” or “baseline group.” Same idea, different label.

How control groups work in experiments

You split your audience randomly into two groups, change one thing for one group, and compare the results.

Here’s the basic flow, whether you’re running a clinical trial or testing a new headline on your website:

  1. Start with your population. That’s everyone in the experiment. Website visitors, email subscribers, patients in a study.
  2. Split them randomly into two groups. Random is the key word. No cherry-picking. No putting your best customers in one group.
  3. Leave one group alone. That’s your control group. They see your current page, get no email, receive the placebo pill.
  4. Change one thing for the other group. That’s your treatment group (sometimes called the experimental group or the variant). They see the new headline, get the promotional email, take the new medication.
  5. Measure the same thing for both groups. Conversion rate, recovery time, purchase rate, whatever matters.
  6. Compare. If the treatment group does better than the control group, your change probably worked.

The random split is what makes this trustworthy. Without it, you’re just comparing apples to oranges. (Need help structuring this? Grab an A/B testing template and fill in the blanks.)

Both groups also need to run at the same time. Comparing this month’s results to last month’s isn’t a real control. Too many things change between months (seasons, holidays, a competitor launching a sale).

That’s why designing a proper marketing experiment always includes a simultaneous control.

A/B testing tools like Kirro handle the random split automatically. You pick what to change, the tool divides your visitors 50/50 and tracks the results. No spreadsheet required.

Our take: The hardest part of control groups isn’t the concept. It’s the patience. People want to skip ahead and ship the change. But the whole point is waiting for the data, and the data needs a baseline to mean anything.

Control groups in science vs. marketing

Same core principle, very different stakes. A scientist uses a control group to test a drug. A marketer uses one to test a checkout page.

The concept of a control group has been around for almost 300 years. In 1747, a ship’s doctor named James Lind tested six different scurvy treatments on 12 sailors aboard the HMS Salisbury.

He gave each pair a different remedy (citrus fruit, cider, vinegar, seawater, and two others), kept their conditions identical, and compared what happened. The citrus pair recovered in six days. Everyone else stayed sick.

That’s a control group in action. And the logic hasn’t changed since.

In 1935, statistician Ronald Fisher formalized the idea. He proved that random assignment eliminates bias and makes results trustworthy. His work at an agricultural research station (testing crop yields) became the foundation of every clinical trial and A/B test run today.

Here’s how the same principle plays out in two different worlds:

ScienceMarketing
What’s testedNew drug, therapy, medical deviceNew headline, page layout, email
Control receivesPlacebo pill or standard treatmentCurrent page or no campaign
Sample sizeHundreds to thousandsThousands to millions
DurationMonths to yearsDays to weeks
Ethics reviewRequired (you can’t give a placebo when a proven treatment exists)None required
Cost per testHigh (staff, regulatory, time)Near zero
What’s at stakeHuman healthRevenue

The FDA requires clinical trials to include a control group. Their official guidance (ICH E10) defines five types of controls and when each is appropriate.

Marketing has no such rules. Nobody will stop you from shipping a redesign without testing it. But that doesn’t mean you should skip the control.

At Booking.com, a team of 1,800+ runs over 25,000 A/B tests per year. Ninety percent fail. Meaning 9 out of 10 ideas don’t improve anything.

Without a control group for each test, they’d have shipped thousands of changes that either did nothing or made things worse.

Google runs 10,000+ experiments a year. Microsoft’s Bing team once ran a “low priority” A/B test on ad headline formatting. It led to a 12% revenue increase, translating to over $100 million annually.

The test sat ignored for six months before someone ran it. No one predicted it would be worth nine figures.

Our take: The science-vs-marketing gap is mostly about stakes, not method. A marketer who skips control groups makes the same logical error as a scientist who skips them. They just lose money instead of lives.

Types of control groups

There are six main types. Most marketers only need to know three.

Not all control groups work the same way. The FDA defines five types for clinical trials, and marketing has borrowed (and adapted) most of them.

TypeScience exampleMarketing exampleWhen to use it
Placebo controlPatients get a sugar pillVisitors see the unchanged pageStandard A/B test
Active controlPatients get the current best drugYou test against your best-performing pageWhen you already know something works
No-treatment controlPatients receive nothingCustomers get no email at all (holdout testing)Measuring if a campaign works at all
Historical controlCompare to past patient dataCompare this month to last monthWeak. Conditions change. Avoid if possible
Positive controlA known-to-work drug verifies the setupAn A/A test confirms your tool is workingValidating your experiment setup
Waitlist controlPatients wait their turn for treatmentProspects on a waitlist vs. immediate accessSaaS trials, product launches

The most common mistake? Treating historical controls as just as good as concurrent ones. Comparing your February numbers to March isn’t a real experiment.

A 2024 meta-analysis of 333 clinical trials found that even the type of control you pick changes the results. Waitlist controls, for example, systematically made treatments look more effective than they actually were.

The type of control group you choose matters almost as much as having one at all.

Control group vs. experimental group

The control group stays unchanged. The experimental group gets the new thing. That’s the whole difference.

This is the most common question in any science class (and honestly, in plenty of marketing meetings too).

Control groupExperimental group
What it receivesNo change. The original.The new version, treatment, or variable.
Its roleBaseline for comparisonTests whether the change makes a difference
Also calledBaseline group, comparison groupTreatment group, test group, variant
In A/B testingVersion A (your current page)Version B (the new version)
How many?Usually oneCan be multiple (B, C, D…)

You can run tests with multiple experimental groups. That’s what multivariate testing does. You test several changes at once, each against the same control.

The difference between A/B testing and multivariate testing comes down to how many things you’re changing at a time. But you typically only need one control group. It’s the anchor.

Control group examples

Here are four real examples, from hospitals to inboxes, showing how control groups work in practice.

Clinical trial

Two hundred patients with the same condition are split randomly. One hundred get the new drug. One hundred get a sugar pill (placebo).

After 12 weeks, doctors compare recovery rates. The sugar pill group is the control. If recovery is 60% in the drug group vs. 30% in the control group, the drug likely works.

Email marketing (holdout)

You have 10,000 subscribers. You send a promotional email to 9,000 of them. The other 1,000 get nothing. That held-back group is your control.

If 4% of the email group buys and only 2% of the holdout group buys, your email drove the difference.

This is how Optimove discovered that a campaign appearing to generate $40,000 in revenue was actually only responsible for $17,100. The other $22,900 would have happened anyway.

Website A/B test

Fifty thousand visitors come to your landing page. Half see the original (control). Half see a version with a new headline (treatment). You track your conversion rate for both.

If the new headline converts at 4.2% vs. the original’s 3.5%, you’ve got a winner.

Tools like Kirro set this up in minutes. You pick the page, make the change in a visual editor, and the tool handles the random split and metrics tracking. No developer needed.

Advertising

Alibaba ran an experiment with 555,800 customers. The treatment group got personalized product recommendations. The control group got generic ones.

Removing personalization caused a 75% drop in click-through rates and an 81% drop in purchases. The control group proved personalization wasn’t just correlated with sales. It caused them.

Why control groups matter

Without a control group, you can’t tell if your change worked or if you just got lucky.

At Microsoft, former technical fellow Ronny Kohavi found that only about one-third of experiments improve their target metric. Another third have no effect. And the last third actually make things worse.

That’s from Trustworthy Online Controlled Experiments, the industry’s reference book on experimentation.

Without a control group, you’d ship all three types equally.

Mastercard’s 2024 State of Business Experimentation report backs this up: 46% of retail ideas don’t break even when properly tested. Almost half. If those ideas had been rolled out without testing, someone would have called them “innovations.”

The control group is what separates “we think this worked” from “we know this worked.”

Google runs 10,000+ experiments per year. Amazon runs 12,000+. Booking.com runs 25,000. The average company? Two to three tests per month.

Companies that test more win more, but only because each test has a control group that catches the losers before they ship.

Even 77% of organizations that run A/B tests on their websites are doing better than most. Only 0.2% of all websites use A/B testing tools at all. The bar is low.

If you want to start testing, it doesn’t have to be complicated. Set up a simple A/B test with your current page as the control, make one change, and see what happens.

Common control group mistakes

The biggest mistakes aren’t about math. They’re about discipline and setup.

Contamination

Your control group accidentally gets exposed to the treatment. Maybe a promotional email gets forwarded to holdout customers. Maybe a feature change leaks through shared infrastructure.

Meta, Microsoft, and Booking.com have reported that unaddressed contamination can reduce measured lift by 30 to 70%. Your test might show a 5% improvement when the real number is 15%, or vice versa.

Too-small control groups

A control group needs enough people to detect real differences. For A/B tests, a 50/50 split gives you the most statistical power (your test’s ability to spot a real difference when one exists).

For marketing holdouts, 5% works for campaigns targeting 10,000+ customers. Smaller campaigns need 10-20%. Running a test with fewer than 1,000 visitors? You’re flipping a coin and calling it data.

Non-random assignment

Putting your best customers in one group and everyone else in another isn’t an experiment. It’s wishful thinking.

Random assignment is what makes the groups comparable. Skip it and your results are meaningless.

Checking results too early

Looking at your test results before you have enough data leads to false positives (Type 1 errors). The numbers look promising on day two, you call a winner, and a week later the effect disappears.

This is called “peeking,” and it’s one of the most common A/B testing mistakes. Wait for your test to reach the sample size set by your minimum detectable effect (the smallest improvement worth detecting).

Comparing “before” to “after” instead of using a simultaneous control

This month’s conversion rate is higher than last month’s. Must be the redesign, right? Not necessarily.

Seasonal traffic, a competitor going offline, a viral social post. Any of these could explain the change. A true control group runs at the same time as your treatment group. That’s the only way to isolate the variable.

Skipping the control entirely

Some marketers skip control groups because they don’t want to “leave money on the table” by not showing the campaign to everyone. But without a control, you never know if the campaign actually worked.

You might be optimizing something that has zero real impact.

The fix is simple. Set up a quick A/B test with a proper control group and let the numbers tell you what’s actually working.

Our take: If you’re not willing to set aside a small group as a control, you’re not really testing. You’re just making changes and hoping.

FAQ

Quick answers to the most common control group questions, from science class to marketing strategy.

What is a control group in an experiment?

A control group is the group in an experiment that receives no new treatment or change. It stays the same while the experimental group gets the thing you’re testing.

By comparing the two, you can figure out whether your change actually caused the result. In website testing, the control group sees your current page (Version A), and the experimental group sees the new version (Version B).

The null hypothesis (your starting assumption that there’s no difference between groups) is what your test tries to disprove.

What is a good example of a control group?

An email holdout is one of the clearest examples. Say you have 10,000 subscribers and you’re running a promotional campaign. Send the email to 9,000 (your treatment group) and hold back 1,000 (your control group).

If 4% of the email group purchases but only 2% of the control group does, you know the email drove about 200 extra sales. Without the holdout, you’d have assumed all 360 purchases were caused by the campaign.

How do you identify a control group?

The control group is the one that doesn’t receive the new treatment or change. In an A/B test, it’s Version A, your current page. In a clinical trial, it’s the placebo group.

Ask yourself: “Which group stays the same?” That’s your control.

What is a control group in science?

A science control group follows the same rules as any other, just with higher stakes. In a clinical trial, the control group might receive a placebo (sugar pill) while the treatment group gets the actual medication.

The FDA requires clinical trials to include a properly designed control group. In laboratory research, the control keeps all variables constant except the one being tested.

Same principle as marketing: isolate the variable, measure both groups, compare.

Can you run an experiment without a control group?

Technically, yes. Synthetic controls let you build an artificial comparison group from historical data. Quasi-experiments compare similar (but not randomly assigned) groups.

Multi-armed bandit testing shifts traffic toward the winning version dynamically, without a fixed control.

Netflix uses geographic experiments when user-level controls aren’t feasible. But these approaches need more data, more assumptions, and more statistical skill.

For most businesses running website tests or evaluating A/B testing alternatives, a simple control group is still the gold standard. Easiest path to results you can trust.

What is the difference between a control group and an experimental group?

The control group receives no change. The experimental group receives the new treatment.

In an A/B test, the control is your current page and the experimental group sees the variation. You can have multiple experimental groups (testing different headlines, images, or layouts), but you typically only need one control. It’s your anchor. Everything else gets measured against it.

Randy Wattilete

Randy Wattilete

CRO expert and founder with nearly a decade running conversion experiments for companies from early-stage startups to global brands. Built programs for Nestlé, felyx, and Storytel. Founder of Kirro (A/B testing).

View all author posts

Try Kirro

Run smarter A/B tests and boost your conversions

Everything. No limits. No surprises.

Get started free