Home Data-Driven Thinking Bigger Data Isn’t Always Better

Bigger Data Isn’t Always Better

SHARE:

omarabdalaData-Driven Thinking” is written by members of the media community and contains fresh ideas on the digital revolution in media.

Today’s column is written by Omar Abdala, chief data scientist at Lotame.

In a fight, would you rather be big or smart? Many would choose bigger, but when it comes to big data, more isn’t necessarily better and it’s certainly not the same as smart data.

Marketers plan to double their spending on big data by 2018, a recent CMO survey found. However, companies “underutilize the marketing analytics that they’ve requested,” according to Christine Moorman, a professor at Duke’s Fuqua School of Business. Every day, marketers gather more data than they can use. Vendors sell marketers more data than is useful.

Why is there a gap between collection and utility? In large part, some of problem can be traced to ad tech’s decadelong arms race to overvalue quantity and undervalue quality.

Sell First, Ask About Performance Never

One of the byproducts of an arms race – or, in this case, a data race – is that buyers and sellers become irrational over time. Supply and demand leapfrog each other to dizzying heights of abundance. Eventually, both sides lose sight of their purpose and simply buying or selling more data becomes the goal.

But isn’t the real goal to run the most effective campaign possible?

Campaign efficacy should be the goal for all involved in ad delivery, but it isn’t for today’s data market. In reality, sellers help buyers zero out their budgets because if they don’t, another vendor will. If all data were equally effective and were fully commoditized on an open market, this wouldn’t be so problematic. But not all data is created equal, and so marketers end up spending their entire budget on data of dubious efficacy.

Marketers are in an unenviable position of having to select data with no information about its impact on the campaign at hand or to use anecdotal prior information. How can this be when today’s targeting technology is so advanced?

It’s relatively easy to construct thin, data-driven campaigns where the targeting is excellent. Search and retargeting are examples. But achieving the scale a marketer demands using these resources is a challenge. Where a thin campaign is defined by quality, a media buy at scale typically includes plenty of proxies for these intent signals. Data buyers struggle to select data points that may have been packaged for a variety of purposes and hope that they prove effective for their specific KPI.

Subscribe

AdExchanger Daily

Get our editors’ roundup delivered to your inbox every weekday.

Eventually, the data vendor’s incentive becomes clear: Keep supplying more data to make the campaign deliverable. Over time, that motivates vendors to create more data than their customers need. Worse, CPM data rates can reward selling data regardless of efficacy.

Incorporating Performance Into The Marketplace

Right now, vendors are paid on a CPM basis regardless of how well the data they sell performs. Many vendors make a handsome profit selling data that doesn’t perform any better than the most basic demographic matching. That is a broken model.

A better model is a market where data buyers can reward vendors that add value and punish those that don’t. If a vendor’s data performs on a particular campaign, it should likewise command a premium for that campaign. If another vendor sells data that is well packaged but ubiquitously available, then it should sell at a lower price that reflects its provenance.

Both buyers and sellers need a true market for valuating data, one where price is tied to performance. Such a model would benefit marketers because it would increase the performance of their ad spend. At the same time, vendors would also benefit from a more transparent, performance-driven market because the buyer feedback would incentivize them to collect and sell only the most valuable data, as opposed to simply selling the most data.

In the short run, shifting to a performance-driven data model will most benefit the vendors who currently provide specific data segments close to consumer intent. But soon enough, a performance-driven model that favors buying smart data over big data would benefit all market participants. Data vendors will be able to export more raw and unrefined data points and earn vastly higher returns. Data buyers will be able to purchase effectiveness rather than bundled CPM data.

Follow Lotame (@Lotame) and AdExchanger (@adexchanger) on Twitter.

Must Read

The AI Search Reckoning Is Dismantling Open Web Traffic – And Publishers May Never Recover

Publishers have been losing 20%, 30% and in some cases even as much as 90% of their traffic and revenue over the past year due to the rise of zero-click AI search.

No Waiting for May – CES Is Where The TV Upfront Season Starts 

If any single event can be considered the jumping-off point for TV upfronts, it’s the Consumer Electronics Showcase (CES), which kicks off this week in Las Vegas, Nevada.

Comic: This Is Our Year

Comic: This Is Our Year

It’s been 15 years since this comic first ran in January 2011, and there’s something both quaint and timeless about it. Here’s to more (and more) transparency in 2026, and happy New Year!

Privacy! Commerce! Connected TV! Read all about it. Subscribe to AdExchanger Newsletters

From AI To SPO: The Top 10 AdExchanger Guest Columns Of 2025

The generative AI trend generated endless hot takes this year, but the ad industry also had plenty to say about growing competition between DSPs and SSPs. Here are AdExchanger’s top 10 most popular guest columns of 2025 and why they resonated.

Comic: Season's Beatings

Enjoy this weekly comic strip from AdExchanger.com that highlights the digital advertising ecosystem … 

6 (More) AI Startups Worth Watching

The founders of six AI startups offer insights on the founding journey and what problems their companies are solving.