How do you measure incrementality across retail media networks like Amazon, Walmart, and Instacart without over-attributing to last click? Can you share one test design or experiment from your own campaigns that gave you a confident read?

Question

Brandon George · Accepted Answer

Incrementality across Amazon, Walmart, and Instacart works best when measured at the SKU x retailer level using a Synthetic Control approach. Instead of leaning on last-click logic, this method builds a counterfactual for each advertised SKU within each retail network, using a weighted blend of similar SKUs that were not advertised during the same window. Those controls come from the same retailer, same category, similar price bands, seasonality, and historical velocity. Because the comparison lives inside each retail ecosystem, the read reflects true demand lift rather than media proximity to conversion, and it stays insulated from regional quirks that often distort geo tests.

This structure avoids geo bias because it never relies on ZIP codes or regional holdouts where retailer coverage, delivery speed, or store density varies. Every SKU competes against its own synthetic twin drawn from national sales patterns inside the same platform. That makes Amazon results comparable to Walmart or Instacart without forcing artificial geographic splits that never behave the same across networks.

The outcome is a clean answer to one question advertisers care about: what sales would have happened anyway for this exact product at this exact retailer.

One example from our agency involved a packaged food brand running always-on sponsored placements across Amazon and Walmart. Last-click reports suggested strong performance on both, yet the synthetic control showed a different story. Amazon delivered a 14 percent incremental lift at the SKU level, while Walmart landed closer to 4 percent, with most volume reflecting baseline demand. Budget allocation changed the following quarter, favoring Amazon for conquesting and Walmart for coverage, and total incremental revenue rose without increasing spend.

Justin Belmont · Answer

The only way we've found to get real incrementality on retail media is to stop trusting platform-reported last click and force some kind of control into the system. For Amazon, Walmart, and Instacart, we lean on geo splits or time-based holdouts where a portion of demand is intentionally left unexposed, even if it makes people nervous. One test that gave us confidence was pausing retail ads in a handful of matched regions while keeping everything else constant, then watching what happened to total sales, not just attributed sales. What surprised people was that some "top-performing" campaigns barely moved the needle once you removed last-click bias. The big lesson was that incrementality shows up in lift against a baseline, not in a dashboard that's paid to take credit. If you're not willing to let a slice of demand go dark temporarily, you're guessing, not measuring.

Moritz Bauer · Answer

In retail media, a big challenge is understanding whether ads actually create new sales - or whether they just take credit for sales that would have happened anyway. Many strategies still rely on "last click" reporting, which gives all the credit to the final ad someone clicks before buying. This often makes results look better than they truly are, especially for brand search ads. To avoid this, we focus on incrementality: would this sale still have happened if we had not shown the ad? We usually use 2 strategies for this:

1. Full Customer Journey

Instead of only looking at the final click, we analyze the full customer journey. Using Amazon Marketing Cloud (AMC), we can connect:

ad impressions (who saw which ads)

...and purchase data (who actually bought)

This allows us to understand what happened before the purchase, not just the final interaction. We then compare two groups of shoppers:

Group A: Shoppers who first saw an upper-funnel ad (such as Sponsored Display or Streaming TV), later searched for the brand and then bought the product

Group B: Shoppers who only searched for the brand (and then bought) but did not see those earlier ads

If Group A converts at a higher rate than Group B, it indicates that the earlier ad helped create demand. This approach gives us a much more realistic view of performance than standard ROAS metrics.

2. Using "New-to-Brand" as a Practical Signal

Both Amazon and Walmart provide New-to-Brand (NTB) metrics, which show whether a customer is buying from a brand for the first time.

While NTB is not a perfect measure of incrementality, it is a very useful indicator.

For example:

A campaign may show strong ROAS

But if 90% of sales come from existing customers

In that case, the campaign is likely low-incremental. Campaigns that drive a healthy share of new customers are generally much more incremental and valuable for long-term growth.

Hope this helps!

Cheers,
Moritz

Rob BonDurant · Answer

However, to understand the incrementality of our spend on retail media channels without the last-click attribution bias, we conduct geo-holdout or audience-holdout experiments, which means we intentionally avoid showing our ads to a certain area. One such example that provided us with a strong read was the Instacart platform. We identified zip codes with similar order volumes and demographics and withheld spend from the control group and ran our campaign on the other geographies. For two weeks, we measured both the sales inside the platform and the total incremental lift of product movement to those geographies using third-party measurement. Since the incremental increase we received from the exposed geographies versus stable and flat sales from the holdout groups provided us with a strong read on the true attribution from our ads, this is not the only way we measure attribution. When we combine this data with modeled attribution from the platforms, we obtain a balanced and fair perspective.

Fahad Khan · Answer

Measuring incrementality across Walmart, Amazon and Instacart without last click bias. It should be from attribution to experiment design.

The Methodology:

Clean Rooms: Go ahead use AMC to join impression data using retail sales, identifying New to Brand shoppers who haven't purchased in 12 months.

Geo Testing: Divide regions into test and control groups. This captures "halo effects" and organic cannibalisation that click tracking misses.

Case study:

Design:Recently we ran a 4 week branded keyword holdout.

Test: continued bidding on brand terms

Control: Kept everything paused, all branded spend in 10 specific DMAs.

Result: The organic listing included 70% of the lost paid traffic, meaning only 30% of branded ad sales were truly incremental.

Outcome: We've reallocated  that 70% waste to category level keywords, which offered a 15% higher total sales lift by  acquiring  new customers other than paying existing  ones.

Soban Tariq · Answer

At Game of Branding, we found something that works by pausing ads in some Amazon regions and comparing them to others. Suddenly, the actual sales lift from our ads became obvious. The attribution numbers can be unreliable, so you have to repeat the test for different products. My advice is to start with easier regions first, then make your tests more precise using the data you gather.

Abhinav Puri · Answer

Retail media campaigns sometimes looked stronger than they really were if judged only by last-click sales. To measure true incrementality, a simple A/B holdout test worked best. Half of the stores or zip codes received the campaign—Amazon Sponsored Ads or Walmart display—while the other half stayed dark. All other variables, like email and social, stayed the same. One test on Instacart display ads showed that regions exposed to the campaign saw a 19% lift in category sales versus the holdout. Only 7% of that lift could be explained by last-click attribution, showing prior models had overestimated results. The experiment confirmed that controlled holdouts, rather than just tracking clicks, gave a clear read on what advertising actually drove incremental sales, and helped guide where to spend the next marketing dollar.

Wayne Lowry · Answer

Incrementality becomes visible once measurement no longer runs the race of attribution and begins isolating absence. A nice read on geo based suppression and not model tuning. Matched markets were selected based on historical sales velocity, basket size and seasonality. Sponsored placements were halted in the test markets for four weeks while elsewhere budgets remained flat. Organic rank, price, and availability were held constant so as to avoid confounding factors. The comparison was on net revenue change and not click paths.

Scale by SEO saw a definite indicate. Amazon and Walmart displayed lift that was greater than the suppressed spend by nineteen percent, which confirmed incremental demand. Instacart showed near zero lift, with sales shifting channels, rather than expanding. That insight changed the allocation of budgets at once. Confidence came from the restrained. No blended dashboards were utilized nor was there any utilization of probabilistic crediting. This experiment was based on authentic absence and authentic dollars.

The imperative discipline is patience. Short tests favour platforms that cycle fast and harm slower platforms. A minimum of three weeks is needed to get purchase behavior back to normal. Incrementality is not something that is hidden. It kills off only when everything is running everywhere all at once.

Kuldeep Kundal · Answer

When it comes to measuring true incrementality on Amazon or Walmart, first you need to get past the platform-reported metrics, which tend to be biased towards the concept of last-click attribution. These people are already in market and have high purchase intent; you need to separate the sales that happen because of your ads from sales that would have happened regardless. You need to prove causation, not correlation.
One rock-solid way to do this is to do a geo-based holdout experiment. One time we created a test for a client where we first scored historic marketplace sales to find a set of statistically similar markets as well as customer profiles for the product. We took half of them and made them the "test" markets where we'd run our retail media campaigns as-is. The other half we made the "control" or "holdout" group, and turned off ad spend altogether for the same product.
By holding those markets flat and measuring the total sales lift (not ad sales) and comparing that to the baseline sales over a 30 day period we were then able to confidently calculate the incremental revenue generated by the campaign. The key is having the data infrastructure that lets you measure total sales in a geo level tidily isolating your media spend impact from organic.

Cyrus Partow · Answer

To get a real sense of incrementality, I don't look at last-click data. I run holdout tests, where we pause retail media in a few spots while keeping spend steady everywhere else. When we did this for Amazon, we saw a clear sales dip in the test markets, which proved the lift was real. It's not a perfect method, but if you want to know if your ads are actually driving extra sales, simple holdouts give you the clearest answer.

Yarden Morgan · Answer

To figure out if our ads were actually doing anything, we ran a geo holdout test. We shut off ads completely in a few regions and went all-in everywhere else. Seeing the sales spike on Amazon and Instacart in the areas where we advertised told us the real story. It wasn't pretty at first, we messed up the setup, but this approach is now our go-to for measuring retail media performance.

Justin Herring · Answer

I do SEO and PPC. It's always tricky to know if ads are actually bringing in new sales or if people would have bought anyway. So we ran a test. For a local retailer, we shut off ads in specific zip codes and just watched what happened to sales. It made explaining results to clients way easier because we could show the real lift. Even a basic test like this tells you a lot.

BURAK KOC · Answer

When launching new products on Amazon, I turn the ads off for a few days, then turn them back on. This shows me what we actually sold because of the ads. We kept realizing the platform was taking too much credit, and this simple test gave us a real answer. We'd still check it against last-click data, but the experiment told the real story. My advice is to run your own small tests instead of just trusting the numbers you're given.

Josiah Lipsmeyer · Answer

Here's what actually works. We shut off all ad spend in specific zip codes for a month, then compared sales to areas where we were still running ads. That gave us the real story, not the misleading one you get from just counting last-click conversions. If you need to prove your stuff is working, run a test in a few controlled regions. It's a simple setup but the results don't lie.

Maegan Damugo · Answer

Incrementality is easier to see if exposure is considered a variable rather than a reward. Health Rising uses geographic holdout tests instead of user-level attribution methods to evaluate the impact of retail media. ZIP code locations with comparable patient demographics and baseline level of demand are randomly divided into exposed and unexposed locations with respect to Amazon, Walmart, and Instacart. Spend is constant within the exposed regions but the control regions have no media support. Conversion lift is calculated based on historical baselines instead of clicks.

An experiment was centered around diagnostic kits, which can be bought off-shelf. Amazon showed great last click performance but the holdout test showed only a 6 percent lift over control. Walmart media had a 14 percent lift even though they had reduced click-through rates. Instacart exposure resulted in a lower number of conversions but in a decrease in time-to-purchase by four days, which was more important in terms of patient follow-through. Those differences would have been hidden under platform-reported attribution.

Confidence arose out of consistency. When the same lift patterns were evident for two separate cycles of eight weeks, permanent changes in budget allocation occurred. Incrementality measure works if the silence is allowed to speak. Absence of ads often tells much more truth than the number of clicks following the ads.

Joshua Eberly · Answer

If you want to know if your ads are actually working beyond just the final click, try this. We split different regions into two groups, half saw our retail ads and half didn't. This let us see the real sales lift, not the sales that would have happened anyway. It was so much clearer than our old method and we could finally make budget calls without just guessing.

Vince Tint · Answer

Figuring out what your ads are actually doing is tough when attribution models just credit the last click. So we tried a different approach. We split our audience in half, showing our retail ads to one group and hiding them from the other. The conversion gap between the two groups showed us the real lift. It gave us confidence in our numbers, though it took some work to get the test set up right.

Bilal Naseer · Answer

I don't really trust last-click numbers. Instead, I run tests where we hold back ads in specific zip codes. For an Instacart campaign, we turned off spending in a few areas. Sales only went up in places where the ads were still running. That's how you know it's actually working. It's not a perfect system, but it's far more reliable than just believing what the ad platforms report.

Ben Rose · Answer

We tried something that actually worked. We picked some zip codes and stopped running Walmart ads there for a bit. Watching what happened in those areas showed us the real impact of our ads. It became obvious our usual last-click model was giving credit to the wrong thing. This kind of geo holdout test isn't the only way, but it's the most direct method I've found to get honest data about what retail ads are really doing.

Ydette Florendo · Answer

Incrementality is further explained as exposure is a variable and not the click. The retailer controlled holdout testing is the cleanest signal at AS Medication Solution. A locked out geo markets or audience segment is deliberately suppressed by the media keeping all other things unchanged. Sales lift is then compared with equal control comparisons of the same period. The delta demonstrates what media actually brought about and not what it touched the last.

All of the retail networks report a high level of last click performance, but overlap is the norm, rather than the exception. Cloning and testing the rates of net new buyers and repeat rates during test windows provide more information as compared to ROAS. As an illustration, when Amazon media is operated and Walmart maintained stationary, household penetration changes explain which channel was the catalyst of incremental behavior and which channel was seizing already active demand.

Time to conversion and basket expansion are also tracked at AS Medication Solution. The incremental media has a tendency of reducing the period of decision making or introduce complementary items rather than allot attribution credits. Incrementality presents itself in behavior changes, rather than dashboards, when two or more networks are used simultaneously. Its purpose is to be clear, not comfortable in numbers that are good to say but that say little.

How do you measure incrementality across retail media networks like Amazon, Walmart, and Instacart without over-attributing to last click? Can you share one test design or experiment from your own campaigns that gave you a confident read?

38 Answers

Related Questions

How do you measure incrementality across retail media networks like Amazon, Walmart, and Instacart without over-attributing to last click? Can you share one test design or experiment from your own campaigns that gave you a confident read?

38 Answers