Guides

Identity Resolution for Platforms: Build vs Buy vs Embed

Should your platform build its own identity graph, buy enterprise access, or embed an API? A practical framework for SaaS teams adding visitor identification.

Nicolas Canal Nicolas Canal · · 12 min read
Identity Resolution for Platforms: Build vs Buy vs Embed

You’re building a SaaS platform and your customers keep asking the same question: “Who’s visiting my website?”

Maybe you’re building an AI SDR tool that needs real-time intent signals. Maybe you run an agency dashboard managing dozens of client accounts. Maybe you’re building a sales engagement platform and your users want to trigger sequences based on actual website behavior, not just email opens.

Whatever the case, you’ve got three options for adding visitor identification to your platform:

  1. Build your own identity graph from scratch
  2. Buy enterprise access from a data giant
  3. Embed an existing API into your product

Two of these will cost you a year of engineering time or $300K+ annually. One will take 10 minutes.

Let’s break down the real costs, timelines, and trade-offs so you can make the right call for your platform.


Why Platforms Need Identity Resolution Now

Here’s the thing about identity resolution: if your platform doesn’t provide visitor data, your customers will buy it separately. And once they’re logging into another tool for something your platform could offer, they start wondering what else they’re missing.

Worse, they might leave for a competitor that bundles it.

This isn’t hypothetical. We’re seeing it across four major platform categories:

AI SDR platforms need to know when a prospect visits a client’s pricing page. Without that signal, the AI is just blasting cold outreach with no context. With it, every email becomes a warm touchpoint. That’s the difference between AI sales agents that work and ones that don’t.

Agency tools managing 20, 50, or 200 client websites need centralized identification across all accounts. Logging into separate dashboards per client doesn’t scale. They need an API that powers everything from a single pane of glass.

Sales intelligence platforms aggregate signals to help reps prioritize. Website visits are one of the strongest intent indicators available — stronger than email opens, stronger than ad clicks. If your platform doesn’t surface them, you’re missing the highest-quality signal.

RevOps dashboards connecting sales and marketing data need visitor identification as another enrichment layer. It’s the pipe that connects anonymous traffic to real people in the CRM.

The pattern is clear: identity resolution is becoming table-stakes infrastructure for B2B platforms. It’s no longer a nice-to-have differentiator — it’s a retention lever. When your customers can get visitor identification natively inside your platform, switching costs go up. When they have to bolt it on separately, they start evaluating whether your platform is really the right home base.

The question isn’t whether to add it. It’s how.


Option 1: Build Your Own Identity Graph

The DIY approach. Full control, full complexity, full cost.

What It Actually Takes

Building an identity graph isn’t a weekend project. Here’s what you’re signing up for:

Data licensing — You need raw identity data to match against. That means negotiating deals with data providers, which typically run $200K-$500K/yr depending on coverage and freshness. And you’ll probably need multiple providers to get reasonable match rates.

Matching infrastructure — Raw data isn’t useful until you build the matching logic. Deterministic matching requires cross-referencing IP addresses, cookies, device fingerprints, and behavioral signals against identity records. This is non-trivial engineering.

Compliance — CCPA, GDPR, state-level privacy laws. You’ll need legal counsel specializing in data privacy, consent management infrastructure, and ongoing compliance monitoring. This alone can cost $50K-$100K/yr in legal fees.

Engineering team — Realistically, you’re looking at 4-6 engineers working for 12-18 months to build a functioning identity graph. That’s before maintenance, data quality monitoring, and iterating on match rates.

Ongoing freshness — Identity data goes stale fast. People change jobs, email addresses expire, phone numbers rotate. You need continuous data refreshes and quality monitoring or your match rates will crater within months.

The Real Numbers

┌─────────────────────────────────────────────────────────┐
│              BUILD YOUR OWN: COST BREAKDOWN             │
├─────────────────────────────────────────────────────────┤
│                                                         │
│   Data licensing:          $200K - $500K/yr             │
│   Engineering (build):     $400K - $900K (12-18 mo)     │
│   Legal/compliance:        $50K - $100K/yr              │
│   Infrastructure:          $50K - $100K/yr              │
│   Ongoing maintenance:     $200K+/yr (2-3 FTEs)         │
│                                                         │
│   ─────────────────────────────────────────             │
│   Year 1 total:            $500K - $2M+                 │
│   Annual ongoing:          $200K+/yr                    │
│   Time to MVP:             12-18 months                 │
│                                                         │
└─────────────────────────────────────────────────────────┘

When Building Makes Sense

Honestly? Almost never — unless identity resolution is your core product. If you’re a $100M+ ARR company and identity is your competitive moat, building makes strategic sense. For everyone else, you’re building a company within a company.

We’ve talked to dozens of platform teams who started down the “build” path. The common pattern: six months in, they’ve spent $300K+, their identity graph matches at 5-10% (compared to mature providers at 30-40%), and their core product roadmap has stalled because the engineers building the graph are the same ones who should be shipping features.

Pros:

  • Full control over data and matching logic
  • No vendor dependency
  • Can be a competitive differentiator if identity is your core offering

Cons:

  • Massive upfront investment ($500K-$2M+)
  • 12-18 months before you have anything usable
  • Compliance burden is 100% yours
  • Data licensing is complex and expensive
  • Constant data freshness work or match rates degrade
  • Pulls engineering focus from your actual product

Option 2: Buy Enterprise Access from a Data Giant

The “write a big check” approach. You get access to an established identity graph, but you’ll pay enterprise prices.

Who Sells Enterprise Access?

The usual suspects:

  • LiveRamp — $300K-$500K/yr starting point
  • Oracle Data Cloud — Similar pricing, long procurement
  • Experian — Enterprise identity resolution, complex contracts
  • Acxiom — Data-as-a-service, Fortune 500-oriented

These are serious data companies with massive identity graphs. The data quality can be excellent. But the buying experience is designed for Fortune 500 procurement teams, not growing SaaS platforms.

The Process

Here’s what the timeline typically looks like:

  1. Initial contact — Fill out enterprise form, wait for sales callback (1-2 weeks)
  2. Discovery calls — 3-5 calls to scope requirements (2-4 weeks)
  3. Legal review — Data licensing agreements, DPAs, compliance review (4-8 weeks)
  4. Contract negotiation — Annual minimums, volume tiers, usage restrictions (2-4 weeks)
  5. Technical integration — Dedicated integration team, staging environment (4-8 weeks)
  6. Go live — Finally sending real traffic through (after 6-12 months)

That’s 6-12 months from first call to live data. If you’re a fast-moving startup, that timeline kills you. Your competitor who embedded an API shipped the feature last quarter while you’re still in legal review.

And here’s the part nobody warns you about: enterprise contracts typically come with minimum annual commitments, auto-renewal clauses, and usage floors. If your platform doesn’t grow as fast as projected, you’re still on the hook for the full contract value.

The Real Numbers

┌─────────────────────────────────────────────────────────┐
│              BUY ENTERPRISE: COST BREAKDOWN              │
├─────────────────────────────────────────────────────────┤
│                                                         │
│   Annual license:          $300K - $500K/yr             │
│   Integration costs:       $50K - $100K (one-time)      │
│   Dedicated eng team:      1-2 FTEs for integration     │
│   Legal/procurement:       $10K - $30K                  │
│                                                         │
│   ─────────────────────────────────────────             │
│   Year 1 total:            $350K - $600K+               │
│   Annual ongoing:          $300K - $500K/yr             │
│   Time to live data:       6-12 months                  │
│                                                         │
└─────────────────────────────────────────────────────────┘

When Buying Makes Sense

If you’re already a large enterprise with procurement infrastructure, legal teams, and budgets that don’t flinch at $300K line items, this path works. The data quality from providers like LiveRamp is genuinely good, and you get established compliance frameworks.

But if you’re a Series A startup or a bootstrapped platform? This isn’t built for you.

Pros:

  • Large-scale, established identity data
  • Compliance frameworks already in place
  • Proven data quality at enterprise scale

Cons:

  • $300K-$500K/yr — prohibitive for most growing platforms
  • 6-12 month timeline from first call to live data
  • Rigid contracts with annual commitments
  • Designed for Fortune 500, not growing SaaS companies
  • Integration complexity requires dedicated engineering resources

Option 3: Embed via API

The fast path. Plug into an existing identity resolution API, get visitor data flowing in hours, and focus on building your actual product.

How It Works

Instead of licensing raw data or building matching infrastructure, you embed a provider’s API directly into your platform. Your customers never see the underlying provider — they just see visitor data appearing in your product.

┌─────────────────────────────────────────────────────────┐
│              EMBED API: HOW IT WORKS                    │
├─────────────────────────────────────────────────────────┤
│                                                         │
│   Your Platform                                         │
│       │                                                 │
│       ├── Creates pixel per client (API call)           │
│       │                                                 │
│       ├── Client installs pixel on their site           │
│       │                                                 │
│       ├── Receives visitor data (webhook)               │
│       │                                                 │
│       └── Surfaces data in your UI                      │
│                                                         │
│   Your Client                                           │
│       └── Sees visitor data in YOUR platform            │
│           (never touches the identity provider)         │
│                                                         │
└─────────────────────────────────────────────────────────┘

The identity provider becomes invisible infrastructure — like Stripe for payments or Twilio for messaging. Your customers get the value. You get the credit.

Who Offers Embeddable APIs?

Not many providers have built their APIs with platform embedding in mind. Most visitor identification tools are dashboard-first, with APIs as an afterthought.

Leadpipe — $1,279/mo for the agency/platform plan (20,000 IDs). Full API with 23 endpoints, programmatic pixel minting, real-time webhooks, person-level intent data across 20,000+ topics, ICP filtering, and CSV export. White-label available.

Customers.ai — Custom pricing, API available but less documented. Primarily focused on their own end-user product.

The key difference: Leadpipe’s API was designed for exactly this use case. Programmatic pixel creation, multi-tenant architecture, and webhook-first data delivery. Most competitors bolt on an API as a secondary feature.

The Real Numbers

┌─────────────────────────────────────────────────────────┐
│              EMBED API: COST BREAKDOWN                  │
├─────────────────────────────────────────────────────────┤
│                                                         │
│   Monthly plan:            $1,279/mo ($15,348/yr)       │
│   Engineering time:        Hours to days (0-1 FTE)      │
│   Legal/compliance:        Handled by provider          │
│   Data licensing:          Included                     │
│   Maintenance:             Minimal (API updates)        │
│                                                         │
│   ─────────────────────────────────────────             │
│   Year 1 total:            ~$15K - $19K                 │
│   Annual ongoing:          ~$15K - $19K                 │
│   Time to live data:       Hours to days                │
│                                                         │
└─────────────────────────────────────────────────────────┘

That’s $15K vs $350K-$2M. Not a typo.

When Embedding Makes Sense

For the vast majority of platforms adding visitor identification, embedding an API is the right call. You’re not building an identity company — you’re building a SaaS product that uses identity data. Focus on your core product, not on data licensing negotiations and compliance frameworks.

Pros:

  • Live in hours, not months
  • $15K/yr vs $300K+/yr
  • 0-1 engineers vs 4-6 engineers
  • Compliance handled by the provider
  • White-label available — your clients never see the underlying provider
  • Scales with usage, no rigid contracts

Cons:

  • Dependent on the vendor for data quality and uptime
  • Match rates depend on the provider (Leadpipe: 30-40%+)
  • Less control over matching logic

Try Leadpipe free with 500 leads ->


The Decision Matrix

Here’s the full comparison across every factor that matters:

FactorBuildBuy EnterpriseEmbed API
Cost Year 1$500K-$2M$350K-$600K$15K-$19K
Annual ongoing$200K+/yr$300K-$500K/yr$15K-$19K/yr
Time to market12-18 months6-12 monthsDays
Engineering needed4-6 FTEs1-2 FTEs0-1 FTE
Match rate controlFullProvider-dependentProvider-dependent
Data freshnessYour responsibilityProvider-managedProvider-managed
Compliance burden100% yoursSharedProvider handles
White-labelYesMaybeYes (Leadpipe)
Multi-tenantCustom buildVariesBuilt-in
Intent dataBuild separatelyMaybeYes (Leadpipe)
ScalabilityUnlimited (if built right)Contract-limitedUsage-based
Vendor riskNoneMediumMedium
Best for$100M+ ARR, identity as coreFortune 500Growing platforms

The pattern is pretty clear. Unless identity resolution is literally your product, embedding an API gives you 90% of the value at 5% of the cost.


The “10-Minute” Integration

Let’s get specific. Here’s what the integration actually looks like with Leadpipe’s API:

Step 1: Create a Pixel for Your Client

// POST /v1/data/pixels
const response = await fetch('https://api.leadpipe.com/v1/data/pixels', {
  method: 'POST',
  headers: {
    'Authorization': 'Bearer YOUR_API_KEY',
    'Content-Type': 'application/json'
  },
  body: JSON.stringify({
    name: 'Client: Acme Corp',
    domain: 'acme.com',
    webhook_url: 'https://yourplatform.com/webhooks/leadpipe'
  })
});

const pixel = await response.json();
// Returns pixel ID + JavaScript snippet for the client to install

Step 2: Give the Client Their Pixel Code

The API returns a JavaScript snippet. Your platform can either:

  • Display it in a “Setup” UI for the client to copy/paste
  • Auto-inject it if you control the client’s site (like agency platforms)
  • Provide installation instructions within your app

Step 3: Receive Visitor Data via Webhook

// Your webhook endpoint receives identified visitors in real-time
app.post('/webhooks/leadpipe', (req, res) => {
  const { visitor, session, pixel_id } = req.body;

  // visitor.email, visitor.name, visitor.company, visitor.title
  // session.pages_viewed, session.referrer, session.duration

  // Route to the right client based on pixel_id
  const client = await getClientByPixelId(pixel_id);

  // Do whatever your platform does with this data:
  // - Trigger AI SDR outreach
  // - Update client dashboard
  // - Enrich CRM records
  // - Score account intent

  res.sendStatus(200);
});

Step 4: Query Historical Data via API

// GET /v1/data - pull visitor data for any client pixel
const visitors = await fetch(
  'https://api.leadpipe.com/v1/data?pixel_id=px_123&days=7',
  { headers: { 'Authorization': 'Bearer YOUR_API_KEY' } }
);

That’s it. Four steps, a few hours of engineering, and your platform now has identity resolution as a service.

The webhook-first architecture is what makes this practical for platforms. Instead of polling an API for new identifications (which introduces latency and complexity), you register a webhook URL and Leadpipe pushes data to you the moment a visitor is identified. Your platform reacts in real time — triggering AI outreach, updating dashboards, enriching CRM records — without any polling infrastructure.

For the full step-by-step walkthrough with production-ready code, see: Add Identity Resolution to Your SaaS in 10 Minutes.


Revenue Model: How Platforms Make Money With Embedded Identity

This isn’t just a feature add — it’s a revenue opportunity. Here’s how the math works for platforms reselling visitor identification to their clients.

The Unit Economics

Line ItemAmount
Leadpipe Agency plan$1,279/mo for 20,000 IDs
Cost per identification~$0.064
Typical client charge$500-$3,000/mo
Clients per plan5-20+

Example: 5 Clients at $1,000/month Each

┌─────────────────────────────────────────────────────────┐
│              PLATFORM REVENUE MODEL                     │
├─────────────────────────────────────────────────────────┤
│                                                         │
│   Revenue:  5 clients x $1,000/mo  =  $5,000/mo        │
│   Cost:     Leadpipe Agency plan   =  $1,279/mo        │
│                                                         │
│   ─────────────────────────────────────────             │
│   Profit:                             $3,721/mo         │
│   Annual profit:                      $44,652/yr        │
│   Margin:                             74.4%             │
│                                                         │
└─────────────────────────────────────────────────────────┘

$3,721/month in profit from a $1,279 investment. That’s a 74%+ margin.

And those margins only improve as you add more clients. At 10 clients paying $1,000/mo, you’re looking at $8,721/mo profit (87%+ margin). At 20 clients, you’ve likely upgraded your plan but your margins are still 80%+.

Compare that to traditional agency services where margins typically range from 15-35%. Visitor identification is one of the highest-margin services you can offer.

Pricing Strategies for Platforms

There are a few ways to package this for your clients:

Bundled — Include visitor identification as part of your platform’s core offering. Justifies a higher monthly fee across the board. Works well for AI SDR and sales engagement platforms.

Tiered add-on — Offer visitor identification as an add-on at $500, $1,000, or $2,000/mo tiers based on identification volume. Common for agency platforms and RevOps tools.

Per-lead pricing — Charge clients per identified visitor ($0.10-$0.50 each). Works at high volumes but requires more sophisticated billing infrastructure.

Included in premium tier — Make it a feature that differentiates your premium plan from basic. Drives upgrades without a separate line item.

For a deeper breakdown on the agency model specifically, see: White-Label Visitor Identification.


Case Study: AISDR — 250K Monthly Identifications via API

Theory is nice. Let’s talk about what this looks like in production.

AISDR is an AI-powered sales development platform that automates outbound prospecting. Their AI writes personalized emails, manages sequences, and books meetings — all without human reps.

The Problem

AISDR’s AI was good at writing emails. But without knowing who was actively engaging with a prospect’s website, every outreach was effectively cold. The AI had no way to prioritize prospects showing real buying intent.

Building visitor identification in-house would have meant:

  • Licensing identity data from multiple providers
  • Building pixel infrastructure from scratch
  • Handling privacy compliance across jurisdictions
  • Dedicating 4+ engineers for 12+ months

That’s a company within a company. Not what a fast-moving AI startup should be building.

The Solution

AISDR embedded Leadpipe’s API into their platform. The integration took less than a week.

The Results

┌─────────────────────────────────────────────────────────┐
│              AISDR: RESULTS                             │
├─────────────────────────────────────────────────────────┤
│                                                         │
│   Monthly identifications:     250,000+                 │
│   Integration method:          API + Webhooks           │
│   Deployment time:             < 1 week                 │
│   Uptime:                      99.9%+                   │
│                                                         │
│   Key outcome: AI has real-time website intent          │
│   signals to personalize every outreach message         │
│                                                         │
└─────────────────────────────────────────────────────────┘

Now when a prospect visits a client’s pricing page, reads a case study, or checks out competitors, AISDR’s AI knows about it in real time. That context gets woven into personalized outreach that actually converts.

The difference? Going from “cold email with AI polish” to “warm outreach triggered by real buying signals.”

What’s worth noting: AISDR didn’t hire a single additional engineer for this integration. Their existing backend team handled it alongside regular feature work. The total engineering investment was measured in days, not months — and the feature immediately became one of their strongest selling points for new customers.

For the full technical breakdown of AISDR’s integration, see: Visitor Identification API for OEM Platforms.


How to Choose: A Practical Framework

The decision matrix above gives you the data. But decisions aren’t just about data — they’re about context. Your team size, your runway, your product roadmap, and where identity fits in your strategy all matter.

Still not sure which path is right? Run through these five questions:

1. Is identity resolution your core product?

If yes — Consider building. Your identity graph IS your competitive moat, and you need full control over matching logic, data sources, and freshness.

If no — Don’t build. You’re diverting engineering from what actually makes your platform valuable.

2. What’s your timeline?

If you can wait 12+ months — Building or buying enterprise are options (if budget allows).

If you need it in weeks or days — Embed an API. There’s no shortcut for the other two paths.

3. What’s your budget for this feature?

$500K+ — All three options are on the table.

$50K-$500K — Buy enterprise or embed API.

Under $50K — Embed API. It’s your only realistic option, and honestly, it’s probably the best one anyway.

4. How many clients need it?

1-10 clients — Embed API. The multi-tenant architecture handles this out of the box.

10-100 clients — Embed API. Scale your plan as usage grows.

1,000+ clients — Embed API or consider building if you’re at that scale AND identity is strategic. Even at this scale, many platforms choose to embed — AISDR handles 250K+ identifications/month through the API without issues.

5. Do you need white-label?

If your clients shouldn’t see the identity provider’s brand, you need either the Build path or a provider that supports white-labeling. Leadpipe offers this. Most enterprise data providers do not (or charge extra).

Bottom line: For 95% of platforms, embedding an API is the right answer. It’s faster, cheaper, and lets you focus engineering resources on what makes your platform unique.


Frequently Asked Questions

What match rates should I expect from an embedded API?

Match rates depend on traffic quality, not just the provider. With Leadpipe, expect 30-40%+ match rates on typical B2B traffic — that’s 240-300% more identifications than most competitors in head-to-head tests. Traffic from paid campaigns and high-intent organic search tends to match at the higher end.

Can I white-label the data so my clients don’t see the provider?

Yes, with the right provider. Leadpipe’s agency/platform plan includes full white-label capabilities — your clients see your branding, your dashboard, your domain. The underlying identity infrastructure is completely invisible. See the white-label guide for details on custom branding.

What about compliance? Am I liable for the data?

When you embed via API, the identity provider handles compliance for the data collection and matching process. Leadpipe is CCPA compliant and provides company-level only data for EU visitors (GDPR). You’re still responsible for how you use the data in your platform, but the heaviest compliance burden — data sourcing, consent frameworks, and identity matching — sits with the provider.

Can I switch providers later if I start with an API?

Yes, and that’s one of the advantages of the embed path. Because you’re integrating through a standard API, switching providers means updating your API calls and webhook handlers — not rebuilding an entire identity graph. Most platforms can migrate to a different provider in days. That said, not all APIs are equal — make sure your initial integration is clean and well-abstracted.

How does intent data work through the API?

Leadpipe doesn’t just identify who is visiting — it tracks what they’re researching across the web. The person-level intent data system monitors 20,000+ topics, so your platform can surface signals like “this visitor has been researching CRM migration tools for the past two weeks.” That context is delivered alongside the visitor identification data through the same API and webhooks. For platforms building AI-powered outreach or lead scoring, intent data is what turns a name and email into an actionable lead.


Start Building

If you’ve read this far, you’re probably already leaning toward the API approach. Here’s the fastest path:

  1. Sign up for a free trial500 identified leads, no credit card required
  2. Get your API key — Available immediately in the dashboard
  3. Create your first pixel via API — Follow the 5-minute quickstart
  4. Set up webhooks — Start receiving real-time visitor data
  5. Scale to production — Upgrade to the agency/platform plan when you’re ready

You can go from zero to receiving identified visitors in your platform in a single afternoon. No $300K contracts. No 12-month build cycles. No 6-month procurement processes.

Just an API key and a few lines of code.

Start your free trial — 500 leads, no credit card ->