Version: Next

Data Mesh for Unified Data Access

Spice supports data mesh architectures by giving domain teams decentralized, real-time data access through a unified SQL interface. Each team manages its own datasets while Spice federates and accelerates queries across all sources, removing the need for centralized data pipelines.

Why Spice.ai?

Federated SQL Queries: Query disparate sources (PostgreSQL, Databricks, S3, on-premises systems) through a single SQL interface. Domain teams access their own data without relying on a central data team.
Local Acceleration: Materialize domain-specific datasets near applications using CDC-based refresh, delivering low-latency access without copying data into a central warehouse.
Governance: Integrates with Databricks Unity Catalog for role-based access control and credential vendoring, so teams maintain security and compliance without custom infrastructure.
Observability: End-to-end visibility into data flows and query performance across domains, simplifying monitoring and debugging.

Example

An organization runs multiple teams, each owning their data in separate systems — one team in PostgreSQL, another in Databricks, a third in S3. Spice federates all three sources and accelerates frequently queried datasets locally, so any application can query across domains with consistent performance.

Example Configuration

datasets:
  - from: postgres:team_a.customers
    name: customers
    acceleration:
      enabled: true
      engine: duckdb

  - from: databricks:team_b.transactions
    name: transactions
    acceleration:
      enabled: true
      engine: duckdb
      mode: file
      refresh_mode: changes

  - from: s3://team-c-data/reports/
    name: reports
    params:
      file_format: parquet
    acceleration:
      enabled: true

This configuration federates customer data from PostgreSQL, transaction data from Databricks (with CDC refresh), and report data from S3, accelerating all three locally for unified access.

Benefits

Decentralization: Teams own and manage their own data while applications query a single endpoint.
Performance: Local acceleration delivers consistent low-latency queries across all domains.
Governance: Centralized access control without centralized data infrastructure.

Learn More

Federated SQL Queries: Documentation and Federated SQL Query Recipe.
Data Acceleration: Documentation and DuckDB Data Accelerator Recipe.
Observability: Documentation.

Why Spice.ai?​

Example​

Example Configuration​

Benefits​

Learn More​

Why Spice.ai?

Example

Example Configuration

Benefits

Learn More