Supermetrics for Databricks

Connect HubSpot to Databricks — CRM Analytics on the Lakehouse

HubSpot logo
HubSpot
via Supermetrics
Databricks logo
Databricks

Load HubSpot CRM deals, contacts, and email campaign data into Databricks for pipeline and marketing analytics.

✓ No setup required ✓ Free 14-day trial ✓ No credit card needed

Why Connect HubSpot to Databricks?

Warehouse your HubSpot data in Databricks for unlimited historical analysis and cross-source SQL.

ML deal scoring with Feature Store

Compute features like deal velocity, touchpoint density, and engagement recency from your HubSpot Delta table, register them in the Databricks Feature Store, and train a deal close prediction model with MLflow — same features for training and serving.

Unity Catalog governance for CRM data

HubSpot contains sensitive deal values and contact information. Unity Catalog lets you apply column masking on PII fields, row filters per business unit, and full data lineage tracking from ingestion through to dashboards.

PySpark + SQL dual interface for pipeline analysis

Sales ops queries pipeline trends with SQL in Databricks SQL Warehouse, while revenue operations builds forecasting models in PySpark notebooks. Both work on the same Delta table — no ETL between analytics and data science.

How to Connect HubSpot to Databricks

Three steps. Under two minutes. Zero code.

  1. 1

    Create a data transfer

    Log into Supermetrics, select your data source and Databricks as your destination.

  2. 2

    Authorize and configure

    Connect your data source account, provide your Databricks workspace URL and access token, choose your catalog and schema, and select the data you want to transfer.

  3. 3

    Set schedule and start transfer

    Choose your refresh frequency (hourly, daily, or weekly) and click Start. Your data begins flowing into Databricks Delta tables automatically.

HubSpot Data Schema in Databricks

Supermetrics creates and maintains clean, typed tables automatically. Here's what your HubSpot data looks like in Databricks.

Column Type Description
date DATE Reporting date
deal_stage STRING Current deal stage
deal_amount DOUBLE Deal value
contacts_created LONG Number of contacts created
email_open_rate DOUBLE Email open rate

Data Freshness & Scheduling

HubSpot data is typically available in Databricks within 3-6 hours of the sync schedule.

What HubSpot Data Can You Pull into Databricks?

Supermetrics gives Databricks access to your full HubSpot reporting data — metrics and dimensions you already know from the HubSpot interface.

Key Metrics

  • Contacts created
  • Deals created
  • Deal amount
  • Deal close rate
  • Closed won deals
  • Amount won
  • Email open rate
  • Email click rate
  • Email bounce rate
  • Form submissions
  • Page views
  • Sessions
  • MQLs
  • SQLs
  • Customer count

Key Dimensions

  • Deal stage
  • Deal pipeline
  • Contact lifecycle stage
  • Contact source
  • Company name
  • Company industry
  • Owner
  • Campaign
  • Email name
  • Form name
  • Landing page
  • Lead status

Why Supermetrics for Databricks?

Purpose-built for marketing data since 2009. 200,000+ companies trust Supermetrics to move 15% of global ad spend into reporting and analytics destinations.

No Vendor Lock-In

Your data lands in Databricks — infrastructure you own and control. Use any BI tool, any transformation layer, any ML platform. If you ever switch providers, your data and dashboards stay with you.

170+ Marketing Data Sources

Purpose-built for marketing data — not a generic ETL tool. Supermetrics covers 99% of metrics and dimensions from each source, with pre-structured tables ready for analysis. No transformation layer required.

Incremental Loading

Only new and updated HubSpot records are transferred on each run — efficient, cost-effective, and fast. Full historical backfill available on demand.

Enterprise-Grade Security

SOC 2 Type II certified. GDPR and CCPA compliant. OAuth authentication with encrypted credentials. Regional data hosting available. Your data is protected end-to-end.

Flat-Rate, Predictable Pricing

Fixed annual pricing regardless of data volume — no per-row charges, no surprise bills during peak campaign seasons. Transfer as much HubSpot data as you need without worrying about cost spikes.

Complete Data Access

Pull every contact, deal, campaign, and custom property from HubSpot. No field restrictions, no record limits — your complete dataset, ready for analysis.

Frequently Asked Questions

How do I connect HubSpot to Databricks with Supermetrics?

Log into the Supermetrics Hub, create a new data transfer, select HubSpot as the source and Databricks as the destination. Authorize your HubSpot account, provide your Databricks workspace URL and access token, choose your catalog, schema, and Unity Catalog settings, select the fields you need, set a schedule, and start the transfer. No custom notebooks, Spark jobs, or Delta Lake plumbing required — Supermetrics writes directly to Delta tables and registers them in Unity Catalog so your data is governed, versioned, and queryable with both SQL and PySpark from the moment it lands.

Is my HubSpot data secure when transferring to Databricks?

Supermetrics is SOC 2 Type II certified and fully GDPR compliant. All HubSpot credentials are encrypted at rest and in transit. Data flows directly from the HubSpot API into your Databricks workspace — Supermetrics never stores your marketing data on its own servers. Unity Catalog provides centralized governance: fine-grained row-level and column-level security, attribute-based access control, and a full audit log of who queried what. Delta Lake's transaction log makes every write atomic and traceable, so you always have a verifiable lineage of your HubSpot data from ingestion to insight.

Can I combine HubSpot data with other sources in Databricks?

That is one of the defining advantages of the Databricks lakehouse architecture. Once HubSpot data lands as a Delta table, you can JOIN it with any other table in your lakehouse — raw event streams, CRM exports, product analytics, even ML Feature Store tables used for model training. Query in SQL from Databricks SQL warehouses or switch to PySpark and pandas for data science workflows — same data, no copying. Supermetrics supports 170+ connectors that all land in the same Unity Catalog namespace, and the Photon engine accelerates analytical queries on those Delta tables automatically.

What HubSpot metrics and dimensions are available in Databricks?

All standard HubSpot reporting fields are available, including Contacts created, Deals created, Deal amount, Deal close rate, Closed won deals, Amount won, and many more. You select exactly which metrics and dimensions to transfer during setup, and you can add or remove fields at any time without losing historical data already stored in your Delta tables. Delta Lake's time travel lets you query any previous version of your HubSpot data — useful for auditing retroactive metric recalculations or reproducing a dashboard state from last quarter. Schema evolution is handled automatically, so new fields appear as columns without breaking existing queries.

How fresh is HubSpot data in Databricks?

Data freshness depends on your transfer schedule. Supermetrics supports hourly, daily, or weekly transfers into Databricks. Most teams schedule daily transfers so yesterday's complete data is available each morning. Delta Lake's MERGE capability ensures only new and changed records are upserted, keeping cluster utilization and storage costs low. For teams that need near-real-time visibility, the Photon engine accelerates incremental queries so dashboards refresh in seconds, and you can set up Databricks SQL alerts to trigger notifications when key HubSpot metrics cross your thresholds.

Which HubSpot hubs are supported?

The connector supports HubSpot Marketing Hub, Sales Hub, and CRM data including contacts, companies, deals, email campaigns, forms, and landing pages.

Does the connector support HubSpot custom properties?

Yes. Custom properties for contacts, companies, and deals are available as fields you can load into Redshift.

Ready to Connect HubSpot to Databricks?

Join 200,000+ companies that use Supermetrics to connect their marketing data. Set up in under two minutes.

✓ SOC 2 Type II certified ✓ GDPR compliant Trust Center