Data Product Marketplace
A data product marketplace is a discoverable, governed catalog of data products that consumers can search, request access to, and use โ internal (your company's data assets exposed to internal teams) or external (Snowflake Data Marketplace, AWS Data Exchange, where you sell or buy data). External marketplaces created the data-as-product economy: Snowflake's Marketplace alone hosts thousands of providers letting subscribers query third-party data without ETL. Internal marketplaces apply the same UX to your own data โ every dataset has an owner, a SLA, a sample, a freshness indicator, and a 'request access' button. The marketplace pattern is what makes data mesh and data product thinking practically usable.
The Trap
The trap is building a marketplace that's just a renamed catalog. A real marketplace requires (a) productized data assets โ owned, versioned, documented, SLA'd; (b) discoverability that matches how consumers think (search by use case, not just by table name); (c) self-service access with auditable approvals. Most 'internal data marketplaces' are wikis listing tables with stale documentation and no owner. They don't drive consumption because they don't change consumer experience โ finding data is still painful.
What to Do
Before building a marketplace, decide if you have data products to put in it. If your data is mostly raw tables with no owners and no SLAs, the marketplace will be empty (or worse, full of low-quality assets). Start by productizing the top 20 data assets โ assign owners, document semantics, define SLAs โ then expose them in a marketplace UI (DataHub, Atlan, Collibra, or build internal). Track marketplace metrics: search volume, access requests, time-to-first-query for new consumers. If those don't grow, the marketplace isn't working.
Formula
In Practice
Snowflake Data Marketplace launched in 2019 and by 2024 hosted over 2,500 listings from providers like Weather Source, S&P Global, FactSet, and Foursquare. The breakthrough was technical: data subscribers could query providers' data directly via Snowflake Secure Data Sharing โ no ETL, no copies, just live access governed at the source. AWS Data Exchange offers a similar pattern. The external marketplaces normalized the 'data as a product you subscribe to' UX, which has trickled down into internal marketplace patterns at companies like Netflix (with their Metacat catalog), Airbnb (Dataportal), and Lyft (Amundsen).
Pro Tips
- 01
Snowflake Marketplace and AWS Data Exchange are worth understanding even if you're building internally โ their UX (search, sample, subscribe) is the bar for what consumers expect.
- 02
Productize before you marketplace. A marketplace of poorly-owned assets makes data quality WORSE because consumers find and use bad data faster.
- 03
Measure consumer-side metrics, not producer-side metrics. 'Number of datasets in catalog' is vanity. 'Time-to-first-query for new consumers' and 'percent of access requests fulfilled in <24 hours' are real.
Myth vs Reality
Myth
โA marketplace is the same as a data catalogโ
Reality
A catalog is a list of datasets. A marketplace is a list of data products with owners, SLAs, samples, and self-service access. The marketplace pattern is what catalogs evolve into when the underlying datasets are actually productized.
Myth
โBuild it and they will comeโ
Reality
Marketplaces only get used when (a) consumers know about them, (b) the catalog has assets that match real consumer needs, and (c) the access process is genuinely faster than asking the data team. Missing any of those, the marketplace becomes another wiki nobody reads.
Try it
Run the numbers.
Pressure-test the concept against your own knowledge โ answer the challenge or try the live scenario.
Knowledge Check
Your team is building an internal data marketplace. Six months in, the marketplace lists 800 datasets but consumer adoption is flat โ analysts still ask the data team for help finding data. What's the most likely root cause?
Industry benchmarks
Is your number good?
Calibrate against real-world tiers. Use these ranges as targets โ not absolutes.
Time-to-First-Query (New Consumer)
Time from new consumer joining org to first successful query against a marketplace assetElite
< 1 day
Good
1-3 days
Average
1-2 weeks
Poor
> 2 weeks
Source: Hypothetical synthesis from data platform team benchmarks
Real-world cases
Companies that lived this.
Verified narratives with the numbers that prove (or break) the concept.
Snowflake (Snowflake Data Marketplace)
2019-2026
Snowflake launched its Data Marketplace in 2019, built on Secure Data Sharing โ the technical capability that lets one Snowflake account query another's data directly without copies. By 2024, the Marketplace hosted over 2,500 listings spanning weather, financial markets, demographics, marketing data, and more. Subscribers query the data live; providers get usage telemetry and revenue. The Marketplace effectively created the 'data as SaaS' category, normalizing the subscription model for data products.
Launched
2019
Listings (2024)
2,500+
Key Capability
Secure Data Sharing (no ETL)
Marketplaces work when the technical substrate makes consumption frictionless. Snowflake's Secure Data Sharing eliminated the copy/ETL friction that killed earlier 'data marketplace' attempts. The lesson for internal marketplaces: invest in self-service access, not just a catalog UI.
AWS Data Exchange
2019-2026
AWS launched Data Exchange in 2019 to compete in the same data-as-product category. AWS Data Exchange focuses on file-based and API-based data products delivered via S3 and managed entitlements. Providers include Bloomberg, Reuters, and many specialized data vendors. The Exchange demonstrated that the marketplace pattern works across cloud platforms and that data providers benefit from a single distribution channel rather than per-customer integration.
Launched
2019
Delivery Models
S3, API, Redshift, Lake Formation
Notable Providers
Bloomberg, Reuters, Foursquare
External data marketplaces (Snowflake, AWS) define the consumer expectation for what data discovery and access should feel like. Internal marketplaces will be measured against this UX bar.
Related concepts
Keep connecting.
The concepts that orbit this one โ each one sharpens the others.
Beyond the concept
Turn Data Product Marketplace into a live operating decision.
Use this concept as the framing layer, then move into a diagnostic if it maps directly to a current bottleneck.
Typical response time: 24h ยท No retainer required
Turn Data Product Marketplace into a live operating decision.
Use Data Product Marketplace as the framing layer, then move into diagnostics or advisory if this maps directly to a current business bottleneck.