Spice v2.0-rc.1 (Mar 4, 2026)

March 4, 2026 · 23 min read

Sergei Grebnov

Senior Software Engineer at Spice AI

Announcing the release of Spice v2.0-rc.1! 🚀

v2.0.0-rc.1 is the first release candidate for early testing of v2.0.

Highlights in this release candidate include:

Active-Active Highly-Available Distributed Query that is object-store-native and built on Apache Ballista, with dynamic cluster sizing, distributed ingestion, and cluster observability
Spice Cayenne RC with staged append writes, file-based retention deletes, composite partitioning, and distributed ingestion
DataFusion v52.2.0 Upgrade with sort pushdown, a new merge join, and dynamic filters
DDL Support for CREATE TABLE and DROP TABLE via SQL for Iceberg and Cayenne catalogs
DuckLake Catalog & Data Connector for lakehouse-style data management
GCS Data Connector (Alpha) for Google Cloud Storage
Rust CLI Rewrite for a unified single-binary experience
Dependency upgrades including DuckDB v1.4.4, delta_kernel v0.18.2, and mistral.rs

Spice v2.0 includes several breaking changes. Review the breaking changes section before upgrading.

Distribution Changes

AI/ML support including local LLM/ML model and hosted LLM inference is now included in the default Spice build and image. The separate models build variant has been removed.

With models now included by default, the data-only distribution (without AI/ML support) is only published in nightly builds. Official production-ready data-only distributions are available exclusively through Spice Cloud and the Enterprise release.

A new Network Attached Storage (NAS) distribution with built-in SMB and NFS data connector support is also now available in nightly builds and with Spice.ai Enterprise.

Distribution / Variant	Open Source	Spice Cloud	Enterprise
Default	✅	✅	✅
Data	Nightly only	✅	✅
NAS (SMB + NFS)	Nightly only	❌	✅
Metal (macOS)	✅	✅	✅
CUDA (Linux)	Nightly only	✅	✅
Allocator variants	Nightly only	✅	✅
ODBC connector	Local build only	✅	✅

For more details, see the Distributions documentation.

What's New in v2.0.0-rc.1

Active-Active HA Distributed Query

Distributed Query exits Beta with active-active highly-available object-store-based distributed query.

Distributed query supports two execution modes:

Synchronous: Queries for accelerated datasets are distributed across executors and results are streamed back in real-time. Non-accelerated datasets execute only on the scheduler. Best for interactive queries where low latency is critical.
Asynchronous: Queries are submitted via the new HTTP-only /v1/queries API and results are materialized to object storage for later retrieval. Best for long-running analytical workloads, batch processing, and non-accelerated datasets in distributed mode.

Key improvements:

Dynamic Cluster Sizing: The query planner automatically adjusts parallelism based on the number of active executors in the cluster, ensuring optimal resource utilization as nodes are added or removed.
Distributed Ingestion: Data ingestion for partitioned accelerated tables is now distributed across executor nodes, enabling higher throughput and parallel data loading in cluster mode. Regular (non-partitioned) accelerated tables do not distribute ingestion loads.
Synchronous Execution on Scheduler: /v1/sql and FlightSQL queries now execute synchronously on the scheduler when appropriate, reducing inter-node overhead for queries that don't benefit from distribution.
Faster Failure Detection: Executor heartbeat timeout reduced from 180s to 30s, enabling the cluster to quickly detect and respond to executor failures.
Cluster Observability: New metrics and Grafana dashboard for monitoring distributed query clusters.

Spice Cayenne Improvements

The Spice Cayenne data accelerator exits Beta with significant reliability and performance improvements:

Staged Append Writes: WAL-based staged append writes prevent partial writes and data loss on stream errors. Batches are written to a WAL file before being committed, ensuring atomicity.
File-Based Retention Deletes: Time-based retention now supports file-level deletes for both position-based and primary-key tables, reducing I/O overhead compared to row-level deletion.
Multiple Partition Expressions: Support for composite partitioning with partition_by: [col1, col2] using hierarchical path-like keys (e.g., 2025/10/15).
Distributed Ingestion: Cayenne catalog now supports distributed ingestion across executor nodes in cluster mode, including UPDATE operations.
Improved Robustness: Fixed CDC edge case where DELETE + UPSERT sequences could produce duplicate primary keys across protected snapshots. Improved upsert handling during runtime restarts.

DataFusion v52.2.0 Upgrade

Apache DataFusion has been upgraded to v52.2.0, bringing significant performance improvements, new query features, and enhanced extensibility.

Performance Improvements:

Faster CASE Expressions: Lookup-table-based evaluation for certain CASE expressions avoids repeated evaluation, accelerating common ETL patterns
MIN/MAX Aggregate Dynamic Filters: Queries with MIN/MAX aggregates now create dynamic filters during scan to prune files and rows as tighter bounds are discovered during execution
New Merge Join: Rewritten sort-merge join (SMJ) operator with speedups of three orders of magnitude in pathological cases (e.g., TPC-H Q21: minutes → milliseconds)
Caching Improvements: New statistics cache for file metadata avoids repeatedly recalculating statistics, significantly improving planning time. A prefix-aware list-files cache accelerates evaluating partition predicates for Hive partitioned tables
Improved Hash Join Filter Pushdown: Build-side hash map contents are now passed dynamically to probe-side scans for pruning files, row groups, and individual rows

Major Features:

Sort Pushdown to Scans: Sorts are pushed into data sources, enabling ~30x performance improvement on pre-sorted data with top-K queries. Parquet scans now reverse row group order for DESC queries on ASC-sorted files
TableProvider supports DELETE and UPDATE: New hooks for DELETE and UPDATE statements in the TableProvider trait, enabling Iceberg and Cayenne connectors to implement SQL DELETE and UPDATE operations
More Extensible SQL Planning: New RelationPlanner API for extending SQL planning for FROM clauses, enabling support for vendor-specific SQL dialects

DDL Support for Iceberg and Cayenne

SQL Schema Management: Spice now supports CREATE TABLE and DROP TABLE DDL operations for Iceberg and Cayenne catalogs via FlightSQL and the /v1/sql API. DML validation has been updated for catalog-level writability.

DuckLake Catalog & Data Connector

Lakehouse-Style Data Management: New DuckLake catalog and data connector enable lakehouse-style data management with DuckDB as the metadata catalog and object storage for data files. DuckLake provides ACID transactions, time travel, and schema evolution on top of Parquet files.

GCS Data Connector (Alpha)

Google Cloud Storage Support: New Google Cloud Storage data connector enables federated queries against data stored in GCS buckets, with Iceberg table support.

Rust CLI Rewrite

Unified Single-Binary Experience: The Spice CLI has been completely rewritten from Go to Rust, eliminating the Go dependency and providing a single spice binary built from the same codebase as spiced. This improves startup performance, reduces distribution size, and ensures consistent behavior between CLI and runtime.

Key Features:

Full Feature Parity: All 27+ CLI commands re-implemented in Rust with identical behavior
New spice query Command: Interactive REPL for async queries via the /v1/queries API with multi-line SQL input, spinner progress indicator, Ctrl+C cancellation, and partial query ID matching
--output=json Flag: Machine-readable JSON output for CLI commands, enabling scripting and automation
spice login --output: New output modes (env, json, keychain) for flexible credential management
spice cloud metrics: New command for Spice Cloud deployment metrics

Models Included by Default

Local LLM/ML model inference (via mistral.rs) is now included in the default Spice build. The separate models build variant has been removed. This simplifies installation and ensures all users have access to local AI inference capabilities.

Error Propagation for Dataset and Model Status APIs

The /v1/datasets and /v1/models APIs now return structured error information when a component is in an Error state. The ?status=true query parameter must be passed to retrieve the real-time component status, including the error state and details. Previously, the status field only indicated Error with no further detail. Now, two new fields are included when ?status=true is specified:

error: A structured object with category, type, and code fields for programmatic error handling (e.g. { "category": "dataset", "type": "auth", "code": "dataset.auth" }).
error_message: A human-readable description of why the component entered an error state.

These fields are only present when ?status=true is passed and the component is in an error state.

Example /v1/datasets?status=true response:

[
  {
    "from": "postgres:syncs",
    "name": "daily_journal",
    "replication_enabled": false,
    "acceleration_enabled": true,
    "status": "Ready"
  },
  {
    "from": "databricks:hive_metastore.default.messages",
    "name": "messages",
    "replication_enabled": false,
    "acceleration_enabled": true,
    "status": "Error",
    "error": {
      "category": "dataset",
      "type": "auth",
      "code": "dataset.auth"
    },
    "error_message": "Unable to authenticate with datasource credentials"
  }
]

The spice datasets and spice models CLI commands now include an ERROR column that displays the error message for any component in an error state.

Additional Dependency Upgrades

Dependency	Version
Ballista	v52.0.0
DuckDB	v1.4.4
delta_kernel	v0.18.2
mistral.rs	v0.7.0 (candle fork removed, now uses candle 0.9.2 from crates.io)
Turso (libsql)	v0.4.4
Vortex	Upgraded with CASE-WHEN support
AWS SDK	Multiple crates updated + APN user-agent support

Other Improvements

Spicepod v2 Support: Spicepods now support version v2, and spice init generates spicepod.yaml files with version: v2 by default while maintaining backward compatibility for existing v1 spicepods.
x.ai Models: x.ai models now exclusively use the /v1/responses endpoint with rate limiting support.
HuggingFace Chat Templates: Added support for chat templates in HuggingFace model configurations.
Databricks SQL Dialect: Added Databricks SQL dialect for DataFusion unparser, improving federation query generation.
Snowflake: Added snowflake_private_key parameter for key-pair authentication.
Acceleration Metrics: New rows_written, bytes_written, and dataset_acceleration_size_bytes metrics for acceleration refresh ingestion.
Refresh SQL UDFs: Core scalar UDFs are now enabled in refresh SQL expressions.
FlightSQL: Fixed TLS connection handling for grpc+tls:// endpoints with custom CA certificate support.
FlightSQL: Fixed schema consistency by expanding view types and verifying field names.
Hash Index: Fixed query correctness when hash index is used with additional filters.
Results Cache: Fixed schema preservation for empty query results.
Query Nullability: Reconciled execution stream nullability with logical plan schema.
Schema Evolution: Graceful handling of schema evolution mismatch errors during data refresh.
Internal YAML Parser: Replaced deprecated serde_yaml with an internal YAML implementation.

Spicepod v1 to v2 Changes

Spicepod v2 introduces configuration improvements while maintaining backward compatibility with v1. Existing v1 spicepods continue to work — deprecated fields are automatically migrated at load time.

Version support:

Version	Status
`v2`	Default. Used by `spice init`.
`v1`	Supported. Deprecated fields auto-migrate.
`v1beta1`	Removed. No longer accepted.

Configuration changes:

v1 (deprecated)	v2 (preferred)	Notes
`runtime.results_cache`	`runtime.caching.sql_results`	All fields migrate automatically. `cache_max_size` → `max_size`.
`runtime.memory_limit`	`runtime.query.memory_limit`	Auto-migrated. `query.memory_limit` takes priority if both set.
`runtime.temp_directory`	`runtime.query.temp_directory`	Auto-migrated. `query.temp_directory` takes priority if both set.
`dataset.invalid_type_action`	`dataset.unsupported_type_action`	Auto-migrated. v2 adds a new `string` variant.

New v2 fields:

runtime.ready_state — Controls when the runtime reports ready (on_load default, or on_registration).
runtime.flight.do_put_rate_limit_enabled — Enable/disable FlightSQL DoPut rate limiting (default: true).
runtime.query.spill_compression — Compression for query spill files (e.g., lz4_frame).
runtime.scheduler.partition_management — Configure partition assignment interval, limits, and timeouts for distributed mode.
runtime.caching.sql_results.stale_while_revalidate_ttl — Serve stale cached results while revalidating in the background.
runtime.caching.sql_results.encoding — Cache entry compression (e.g., zstd).
catalog.access: read_write_create — New access mode for catalogs that support DDL operations.

Migration note: When both the deprecated v1 field and its v2 equivalent are set, the v2 field takes priority.

Contributors

Breaking Changes

Cayenne and Distributed Query exit Beta: Beta warnings have been removed from documentation and code. Both features are now considered GA-ready.
Models included by default: The separate models build variant has been removed. Local LLM inference is now always included.
Spicepod version defaults to v2: New spicepods created with spice init now default to version: v2. Existing v1 spicepods remain supported, and v1beta1 is no longer accepted.
Windows native builds removed: Native Windows builds are no longer provided. Use WSL for local development instead.
Metric renames: accelerated_refresh metrics renamed to acceleration_refresh for consistency. last_refresh_time gauge renamed to include milliseconds unit.
Caching config renamed: ResultsCache replaced with SQLResultsCacheConfig in configuration.
DuckDB parameter rename: partitioned_write_flush_threshold renamed to partitioned_write_flush_threshold_rows.
v1/search API: The /v1/search API now always returns an array in matches, even for single results.
x.ai model endpoint: x.ai models now exclusively use the /v1/responses endpoint.
Error messages: Error messages across S3 Vectors, ScyllaDB, Snowflake, ClickHouse, and other components have been refactored for clarity and consistency.

Cookbook Updates

New and updated Spice Cookbook recipes:

Async Queries: Submit long-running queries asynchronously and retrieve results later.
DuckLake Catalog Connector: Use DuckLake for lakehouse-style data management with ACID transactions and time travel.

The Spice Cookbook includes 88 recipes to help you get started with Spice quickly and easily.

Upgrading

To upgrade to v2.0.0-rc.1, use one of the following methods:

CLI:

spice upgrade

Homebrew:

brew upgrade spiceai/spiceai/spice

Docker:

Pull the spiceai/spiceai:2.0.0-rc.1 image:

docker pull spiceai/spiceai:2.0.0-rc.1

For available tags, see DockerHub.

Helm:

helm repo update
helm upgrade spiceai spiceai/spiceai --version 2.0.0-rc.1

AWS Marketplace:

Spice is available in the AWS Marketplace.

What's Changed

Changelog

Add TPC-DS integration tests with S3 source and PostgreSQL acceleration by @phillipleblanc in #9006
fix(tests): fix flaky/slow/failing unit tests by @phillipleblanc in #9009
fix: Update benchmark snapshots for DF51 upgrade by @app/github-actions in #9008
fix: add feature gate to rrf TEST_EMBEDDING_MODEL by @phillipleblanc in #9017
fix: features check by @phillipleblanc in #9014
fix: Enable Cayenne acceleration snapshots by @lukekim in #9020
URL table support by @lukekim in #9018
ScyllaDB key filter by @lukekim in #8997
fix: Schema mismatch when using column projection with HTTP caching by @phillipleblanc in #9021
Add more tests for HTTP caching with columns selection by @sgrebnov in #9025
HTTP cache snapshots: default to time_interval and fix snapshots_creation_policy: on_change by @sgrebnov in #9026
Fix duplicate snapshot creation on startup by @sgrebnov in #9029
Add ScyllaDB and SMB to the README table by @krinart in #9034
Remove waiting for runtime to be ready before creating snapshot by @krinart in #9033
Fix snapshot on_change policy to skip when no writes occurred by @sgrebnov in #9028
Release notes for release release/1.11.0-rc.2 by @krinart in #9016
ci: use arduino/setup-protoc for official protobuf compiler by @phillipleblanc in #9036
ci: install unzip on aarch64 runner for arduino/setup-protoc by @phillipleblanc in #9038
fix: don't fail release if upload to minio fails by @phillipleblanc in #9039
Add missing protoc step to setup-cc action by @krinart in #9041
fix: Update Search integration test snapshots by @app/github-actions in #9013
Fix formula_1 and codebase_community in bird-bench by @Jeadie in #9000
Cayenne S3 Express One Zone improvements by @lukekim in #9015
Add zlib1g-dev to CI by @lukekim in #9052
Improve validation and logging for hash indexes by @lukekim in #9047
Upgrade Vortex with CASE-WHEN by @lukekim in #9051
x.ai models now exclusively use /v1/responses endpoint by @lukekim in #9400
Improvements for snapshot schema comparison by @krinart in #9401
v2.0 breaking changes by @lukekim in #9233
Create PartitionManagementTask for scheduler to update accelerated table partition assignments by @Jeadie in #9378
refactor(Cayenne): route all write orchestration through CayenneDataSink by @sgrebnov in #9402
Refactor benchmark to use QueryExecutor trait by @Jeadie in #9418
feat: Add spidapter build and release workflow by @peasee in #9427
Testoperator: add support for api-key when connecting to external spice instance by @sgrebnov in #9421
Initial implementation of Ducklake catalog & data connectors by @lukekim in #9083
Require aws_lc_rs since jsonwebtoken upgrade by @Jeadie in #9426
feat: Add spidapter tool by @peasee in #9425
Add release notes for 1.11.2 patch release by @sgrebnov in #9430
feat(spidapter): integrate system-adapter-protocol with SCP provisioning by @phillipleblanc in #9434
Add DuckLake TPCH E2E workflow and federated Spicepod configuration by @lukekim in #9431
fix(spidapter): use Flight handshake auth instead of x-api-key header by @phillipleblanc in #9435
[spidapter] Keep only what sparks joy by @Jeadie in #9439
Refactor binary operator balancing by @Jeadie in #9424
feat: Add Iceberg DDL support (CREATE TABLE / DROP TABLE) for default catalog override by @phillipleblanc in #9440
Fix Flight SQL schema consistency: expand view types and verify field names by @sgrebnov in #9438
Update spidapter for new system-adapter-protocol by @sgrebnov in #9442
docs: fix typos and syntax errors in style guide and error handling docs by @cluster2600 in #9445
Add acceleration refresh ingestion metrics (rows_written, bytes_written) by @phillipleblanc in #9461
Refactor(Cayenne): Replace CatalogError and string based errors with Snafu errors by @sgrebnov in #9403
Replace deprecated claude-3-5-haiku-latest with claude-haiku-4-5 by @Jeadie in #9492
Fix #9481: Preserve schema in results cache for empty query results by @phillipleblanc in #9485
Fix partition by serializing by @Jeadie in #9474
query: reconcile execution stream nullability with logical plan schema by @phillipleblanc in #9486
initial spice-cloud-client crate and spice cloud metrics --app <app-name>. by @Jeadie in #9480
feat: Return dataset error message in datasets API by @peasee in #9487
Spicebench by @lukekim in #9447
build(deps): consolidate dependabot dependency updates by @phillipleblanc in #9504
fix(cluster): route non-partitioned accelerated tables in distributed mode by @phillipleblanc in #9508
Enable core scalar UDFs in refresh SQL by @sgrebnov in #9502
Fix metrics in Spidapter again by @Jeadie in #9497
fix(cluster): tolerate Completed->status propagation race in distributed query handle by @phillipleblanc in #9510
feat: Support distributed ingestion in cayenne catalog by @peasee in #9506
Fix Cayenne duplicate primary keys after DELETE + UPSERT CDC sequences by @krinart in #9494
fix(cluster): rewrite table scans inside subqueries for distributed execution by @phillipleblanc in #9518
fix: Set catalog mode to readwritecreate in spidapter by @peasee in #9519
Upgrade AWS SDK crates & set APN user-agent in AWS SDK credential bridge by @lukekim in #8328
feat(runtime): add runtime ready_state on_registration semantics by @lukekim in #9522
fix: Add spidapter post-setup retries by @peasee in #9526
Make partition discovery more robust and make initialization non-blocking by @sgrebnov in #9499
Make lint-rust-fix support targeted packages and features by @Jeadie in #9511
Handle new Cloud SCP API by @Jeadie in #9532
Refactor and simplify streaming benchmarks by @krinart in #9405
fix: ensure spidapter only increments attempts on failures by @peasee in #9534
feat: Support specifying app resources in spidapter by @peasee in #9536
test(runtime): Spice Cayenne DDL integration test by @lukekim in #9535
fix: Handle schema evolution mismatch errors during data refresh by @lukekim in #9527
fix: resolve clippy lint warnings by @phillipleblanc in #9547
pr-builds --tag <TAG> for build_and_release.yml by @Jeadie in #9507
Add --output flag to spice login with env/json/keychain modes by @Jeadie in #9541
Don't use 'PartitionedTableScanRewrite' in async distributed query by @Jeadie in #9548
feat(spidapter): add local backend mode with single executor by @phillipleblanc in #9531
support chat template in HF by @Jeadie in #9543
fix(cayenne): stream PK retention deletes and run OOM regression in CI by @phillipleblanc in #9533
cayenne: Staged append writes to prevent partial writes and data loss on stream error by @sgrebnov in #9491
AcceleratedTable::scan use FederatedTable::scan when ClusterRole::Scheduler by @Jeadie in #9550
Upgrade to delta-kernel-rs v0.18.2 by @lukekim in #9528
Run cayenne tests as part of PR CI by @sgrebnov in #9554
Upgrade to DataFusion v52.2.0 by @lukekim in #9419
Remove Snapshot Compaction + Add snapshot existence check by @krinart in #9523
Update dependencies by @lukekim in #9566
fix: Update benchmark snapshots by @app/github-actions in #9565
fix: Compare Cayenne table configuration on startup by @peasee in #9529
Make Refresh::refresh_sql more robust to alterations over time. by @Jeadie in #9549
fix: Update datafusion-table-providers dependency to latest revision by @lukekim in #9574
Unset AWS_ENDPOINT_URL when empty by @krinart in #9575
fix: allow BytesProcessedExec repartitioning for unordered input by @lukekim in #9540
Sanitize DataFusion errors by @lukekim in #9530
Add conditional logging for partition assignments by @Jeadie in #9577
use 'properly early exit on SIGTERM' by @Jeadie in #9573
Update datafusion to 52.2.0 by @phillipleblanc in #9582
Ensure we query one and only one partition per request by @Jeadie in #9416
feat: Add support for Spicepod version v2 by @lukekim in #9583
[SpiceDQ] Improve error messages; Avoid race condition on allocate_initial_partitions. by @Jeadie in #9579
Update ballista dependencies to latest 52.0.0 revision by @lukekim in #9581
Fix Databricks spark_connect mode always disabled by @phillipleblanc in #9586
Support partitioning in Arrow accelerator by @Jeadie in #9571
Fix spice query CLI response deserialization by @phillipleblanc in #9588
fix: Update benchmark snapshots by @app/github-actions in #9584
fix: Share RuntimeEnv across Cayenne read/write/delete paths for targeted list_files_cache invalidation by @sgrebnov in #9589
feat: Add file:// state_location support for async queries scheduler by @phillipleblanc in #9590
Update endgame links by @krinart in #9598

Full Changelog: https://github.com/spiceai/spiceai/compare/v1.11.2...v2.0.0-rc.1

Distribution Changes​

What's New in v2.0.0-rc.1​

Active-Active HA Distributed Query​

Spice Cayenne Improvements​

DataFusion v52.2.0 Upgrade​

DDL Support for Iceberg and Cayenne​

DuckLake Catalog & Data Connector​

GCS Data Connector (Alpha)​

Rust CLI Rewrite​

Models Included by Default​

Error Propagation for Dataset and Model Status APIs​

Additional Dependency Upgrades​

Other Improvements​

Spicepod v1 to v2 Changes​

Contributors​

Breaking Changes​

Cookbook Updates​

Upgrading​

What's Changed​

Changelog​