Reverse-ETL for Operational Workflows
Spice serves enriched data from warehouses and data lakes directly to operational systems and applications, eliminating the need for complex ETL pipelines. By federating queries across warehouse and operational data sources and accelerating results locally, Spice turns warehouse data into a live, queryable resource for applications.
Why Spice.ai?​
- Unified Query Layer: Federated SQL queries span Databricks, Snowflake, PostgreSQL, and other sources through a single endpoint, removing the need for separate connectors or pipeline orchestration.
- Real-Time Sync: Change Data Capture (CDC) keeps local replicas in sync with upstream sources, so operational systems always reflect the latest warehouse data without scheduled batch jobs.
- Local Acceleration: Materializes working sets of warehouse data into fast local engines like DuckDB or SQLite, providing sub-millisecond query latency for application workloads.
- Governance: Integrates with Databricks Unity Catalog for role-based access control and credential vendoring across federated sources.
Example​
An application serves customer metrics from a Databricks lakehouse to an internal dashboard, keeping the data locally accelerated for fast queries and automatically refreshed as upstream data changes.
Example Configuration​
datasets:
- from: databricks:catalog.schema.customer_metrics
name: customer_metrics
acceleration:
enabled: true
engine: duckdb
mode: file
refresh_mode: changes
refresh_check_interval: 10s
This configuration federates the customer_metrics table from Databricks and accelerates it locally in DuckDB. The refresh_mode: changes setting uses CDC to sync only changed rows, keeping operational data current. The DuckDB Data Accelerator recipe provides a practical guide to materializing datasets for such workflows.
Benefits​
- Simplicity: No pipeline orchestration or ETL infrastructure to maintain — query warehouse data directly.
- Freshness: CDC-based refresh keeps operational data current without manual sync.
- Performance: Local acceleration delivers low-latency queries for application workloads.
Learn More​
- Federated SQL Queries: Documentation and Federated SQL Query Recipe.
- Data Acceleration: Documentation and DuckDB Data Accelerator Recipe.
- Observability: Documentation.
