4 posts tagged with "caching"

Results Cache related topics and usage

Spice v1.10.2 (Dec 22, 2025)

December 23, 2025 · 5 min read

Senior Software Engineer at Spice AI

Announcing the release of Spice v1.10.2! 🔥

v1.10.2 introduces Tiered Caching Acceleration with Localpod for multi-layer acceleration architectures, Periodic Acceleration Snapshots with configurable intervals, DynamoDB JSON Nesting for column consolidation, and Kafka/Debezium Batching for faster data ingestion. This release also includes fixes for SQLite accelerator decimal/date handling and real-time status reporting for the /v1/datasets and /v1/models API endpoints.

What's New in v1.10.2

Tiered Caching with Localpod

Multi-Layer Acceleration Architecture: The Localpod connector now supports caching refresh mode, enabling tiered acceleration where a persistent cache (e.g., file-mode DuckDB) feeds a fast in-memory cache (e.g., Arrow, memory-mode DuckDB).

Key Features:

Automatic Cache Propagation: New cache entries automatically propagate from parent to child accelerators
Warm Startup: Child accelerators initialize from existing parent data on startup, eliminating cold-start latency
Flexible Tiering: Combine any accelerator engines (DuckDB, SQLite, Cayenne) across tiers

Example spicepod.yaml configuration:

datasets:
  # Parent: persistent file-mode cache
  - from: https://api.example.com
    name: api_cache
    acceleration:
      enabled: true
      refresh_mode: caching
      engine: duckdb
      mode: file

  # Child: fast in-memory cache fed by parent
  - from: localpod:api_cache
    name: api_cache_memory
    acceleration:
      enabled: true
      refresh_mode: caching
      engine: arrow
      mode: memory

For more details, refer to the Localpod Data Connector Documentation.

Periodic Acceleration Snapshots

Configurable Snapshot Intervals: A new snapshots_create_interval parameter enables periodic snapshot creation for accelerated datasets across all refresh modes. This provides better control over snapshot frequency and ensures consistent recovery points for accelerated data.

Example spicepod.yaml configuration:

datasets:
  - from: s3://my-bucket/data.parquet
    name: my_data
    acceleration:
      enabled: true
      engine: duckdb
      mode: file
      refresh_mode: caching
      snapshots: enabled
      params:
        snapshots_create_interval: 60s # Write a snapshot every 60 seconds

For more details, refer to the Data Acceleration Documentation.

DynamoDB JSON Nesting

Consolidate Columns into JSON: The DynamoDB Data Connector now supports consolidating columns into a single JSON column using the json_object: "*" metadata option. This is useful when only a few columns are needed as discrete fields while the rest can be accessed as nested JSON.

Example spicepod.yaml configuration:

datasets:
  - from: dynamodb:my_table
    name: my_table
    columns:
      - name: PK
      - name: SK
      - name: data_json
        metadata:
          json_object: '*' # Captures all other columns as JSON

Example Output: Given a DynamoDB table with columns PK, SK, name, email, and status, the resulting table schema consolidates all non-specified columns into the data_json column:

PK	SK	data_json
pk_1	sort_1	`{"name": "Alice", "email": "[email protected]", "status": "active"}`
pk_2	sort_2	`{"name": "Bob", "email": "[email protected]", "status": "inactive"}`

For more details, refer to the DynamoDB JSON Nesting Documentation.

Kafka/Debezium Batching

Faster Data Ingestion: Configure message batching for Kafka and Debezium connectors to improve data ingestion throughput. Batching reduces processing overhead by grouping multiple messages together before insertion.

Key Features:

Configurable Batch Size: Control the maximum number of records per batch (default: 10,000)
Configurable Batch Duration: Set the maximum wait time before flushing a partial batch (default: 1s)

Example spicepod.yaml configuration:

datasets:
  - from: debezium:kafka-server.public.my_table
    name: my_table
    params:
      batch_max_size: 10000 # Max records per batch (default: 10000)
      batch_max_duration: 1s # Max wait time per batch (default: 1s)

For more details, refer to the Kafka Data Connector Documentation and Debezium Data Connector Documentation.

Additional Improvements & Bug Fixes

Reliability: Fixed SQLite accelerator decimal and date type handling for improved data type accuracy.
Reliability: Fixed real-time status reporting for /v1/datasets and /v1/models API endpoints.
Reliability: Fixed Kafka warning when security.protocol is set to PLAINTEXT.

Contributors

Breaking Changes

No breaking changes.

Cookbook Updates

New Cayenne Data Accelerator Recipe: New recipe demonstrating how to accelerate a local copy of the taxi trips dataset using Cayenne as the data accelerator engine. See Cayenne Data Accelerator Recipe for details.

New Dataset Partitioning Recipe: New recipe demonstrating how to partition accelerated datasets to improve query performance. See Dataset Partitioning for details.

The Spice Cookbook includes 84 recipes to help you get started with Spice quickly and easily.

Upgrading

To upgrade to v1.10.2, use one of the following methods:

CLI:

spice upgrade

Homebrew:

brew upgrade spiceai/spiceai/spice

Docker:

Pull the spiceai/spiceai:1.10.2 image:

docker pull spiceai/spiceai:1.10.2

For available tags, see DockerHub.

Helm:

helm repo update
helm upgrade spiceai spiceai/spiceai

AWS Marketplace:

🎉 Spice is now available in the AWS Marketplace!

What's Changed

Changelog

Fix kafka warning when security.protocol is set to PLAINTEXT by @krinart in #8587
fix: SQLite accelerator decimal/date handling by @phillipleblanc in #8606
feat: Enable localpod with caching mode accelerator for tiered caching by @phillipleblanc in #8621
Remove the clippy::too_many_lines lint by @phillipleblanc in #8549
Add snapshot interval for acceleration snapshots by @phillipleblanc in #8627
Json Nesting for DynamoDB by @krinart in #8623
Implement batching for Kafka/Debezium + null Decimal handling by @krinart in #8622
fix: Status field in /v1/datasets & /v1/models by @lukekim in #8633

Spice v1.10.0 (Dec 9, 2025)

December 9, 2025 · 18 min read

William Croxson

Senior Software Engineer at Spice AI

Announcing the release of Spice v1.10.0! ⚡

Spice v1.10.0 introduces a new Caching Acceleration Mode with stale-while-revalidate (SWR) semantics for disk-persisted, low-latency queries with background refresh. This release also adds the TinyLFU eviction policy for the SQL results cache, a preview of the DynamoDB Streams connector for real-time CDC, S3 location predicate pruning for faster partitioned queries, improved distributed query execution, and multiple security hardening improvements.

What's New in v1.10.0

Caching Acceleration Mode

Low-Latency Queries with Background Refresh: This release introduces a new caching acceleration mode that implements the stale-while-revalidate (SWR) pattern. Queries return cached results immediately while data refreshes asynchronously in the background, eliminating query latency spikes during refresh cycles. Cached data persists to disk using DuckDB, SQLite, or Cayenne file modes.

Key Features:

Stale-While-Revalidate (SWR): Returns cached data immediately while refreshing in the background, reducing query latency
Disk Persistence: Cached results persist across restarts using DuckDB, SQLite, or Cayenne file modes
Configurable Refresh: Control refresh intervals with refresh_check_interval to balance freshness and source load

Recommendation: Use retention configuration with caching acceleration to ensure stale data is cleaned up over time.

Example spicepod.yaml configuration:

datasets:
  - from: http://localhost:7400
    name: cached_data
    time_column: fetched_at
    acceleration:
      enabled: true
      engine: duckdb
      mode: file # Persist cache to disk
      refresh_mode: caching
      refresh_check_interval: 10m
      retention_check_enabled: true
      retention_period: 24h
      retention_check_interval: 1h

For more details, refer to the Data Acceleration Documentation.

TinyLFU Cache Eviction Policy

Higher Cache Hit Rates for SQL Results Cache: A new TinyLFU cache eviction policy is now available for the SQL results cache. TinyLFU is a probabilistic cache admission policy that maintains higher hit rates than LRU while keeping memory usage predictable, making it ideal for workloads with varying query frequency patterns.

Example spicepod.yaml configuration:

runtime:
  caching:
    sql_results:
      enabled: true
      eviction_policy: tiny_lfu # default: lru

For more details, refer to the Caching Documentation and the Moka TinyLFU Documentation for details of the algorithm.

DynamoDB Streams Data Connector (Preview)

Real-Time Change Data Capture for DynamoDB: The DynamoDB connector now integrates with DynamoDB Streams for real-time change data capture (CDC). This enables continuous synchronization of DynamoDB table changes into Spice for real-time query, search, and LLM-inference.

Key Features:

Real-Time CDC: Automatically captures inserts, updates, and deletes from DynamoDB tables as they occur
Table Bootstrapping: Performs an initial full table scan before streaming changes, ensuring complete data consistency
Acceleration Integration: Works with refresh_mode: changes to incrementally update accelerated datasets

Note: DynamoDB Streams must be enabled on your DynamoDB table. This feature is in preview.

Example spicepod.yaml configuration:

datasets:
  - from: dynamodb:my_table
    name: orders_stream
    acceleration:
      enabled: true
      refresh_mode: changes # Enable Streams capture

For more details, refer to the DynamoDB Connector Documentation.

OpenTelemetry Metrics Exporter

Spice can now push metrics to an OpenTelemetry collector, enabling integration with platforms such as Jaeger, New Relic, Honeycomb, and other OpenTelemetry-compatible backends.

Key Features:

Protocol Support: Supports the gRPC (default port 4317) protocol
Configurable Push Interval: Control how frequently metrics are pushed to the collector

Example spicepod.yaml configuration for gRPC:

runtime:
  telemetry:
    enabled: true
    otel_exporter:
      endpoint: 'localhost:4317'
      push_interval: '30s'

For more details, refer to the Observability & Monitoring Documentation.

S3 Connector Improvements

S3 Location Predicate Pruning: The S3 data connector now supports location-based predicate pruning, dramatically reducing data scanned by pushing down location filter predicates to S3 listing operations. For partitioned datasets (e.g., year=2025/month=12/), Spice now skips listing irrelevant partitions entirely, significantly reducing query latency and S3 API costs.

AWS S3 Tables Write Support: Full read/write capability for AWS S3 Tables, enabling direct integration with AWS's managed table format for S3. Use standard SQL INSERT INTO to write data.

For more details, refer to the S3 Data Connector Documentation and Glue Data Connector Documentation.

Faster Distributed Query Execution

Distributed query planning and execution have been significantly improved:

Fixed executor registration in cluster mode for more reliable distributed deployments
Improved hostname resolution for Flight server binding, enabling better executor discovery
Distributed accelerator registration: Data accelerators now properly register in distributed mode
Optimized query planning: DistributeFileScanOptimizer improvements for faster planning with large datasets

For more details, refer to the Distributed Query Documentation.

Search Improvements

Search capabilities have been improved with several performance and reliability enhancements:

Fixed FTS query blocking: Full-text search queries no longer block unnecessarily, improving query responsiveness
Optimized vector index operations: Eliminated unnecessary list_vectors calls for better performance
Improved limit pushdown: IndexerExec now properly handles limit pushdown for more efficient searches

For more details, refer to the Search Documentation.

Security Hardening

Multiple security improvements have been implemented:

SQL Identifier Quoting: Hardened SQL identifier quoting across all database connectors (PostgreSQL, MySQL, DuckDB, etc.) to prevent SQL injection attacks through table or column names
Token Redaction: Sensitive authentication tokens are now fully redacted in debug and error output, preventing accidental credential exposure in logs
Path Traversal Prevention: Fixed tar extraction operations to prevent directory traversal vulnerabilities when processing archived files
Input Sanitization: Added strict validation for top_n_sample order_by clause parsing to prevent injection attacks
Glue Credential Handling: Prevented automatic loading of AWS credentials from environment in Glue connector, ensuring explicit credential configuration

Developer Experience Improvements

Health probe metrics: Added health probe latency metrics for better observability
CLI improvements: Fixed .clear history command in the REPL to fully clear persisted history

Contributors

Breaking Changes

No breaking changes.

Cookbook Updates

No major cookbook updates.

The Spice Cookbook includes 82 recipes to help you get started with Spice quickly and easily.

Upgrading

To upgrade to v1.10.0, use one of the following methods:

CLI:

spice upgrade

Homebrew:

brew upgrade spiceai/spiceai/spice

Docker:

Pull the spiceai/spiceai:1.10.0 image:

docker pull spiceai/spiceai:1.10.0

For available tags, see DockerHub.

Helm:

helm repo update
helm upgrade spiceai spiceai/spiceai

AWS Marketplace:

🎉 Spice is now available in the AWS Marketplace!

What's Changed

Changelog

Test-operator: Add tpcds_q8 to the default row-count validation skip list by @sgrebnov in #8185
fix: Remove unwrap_used from test by @peasee in #8212
Run glue_iceberg_integration_test_catalog as part of main integration tests by @sgrebnov in #8222
Add TPCH sf100 testoperator spicepods with dispatch by @Jeadie in #8192
Build with CPU native flags by @lukekim in #8224
fix: Apply assertion clippy in CI/Makefile only by @peasee in #8229
feat: Support running queries only in testoperator by @peasee in #8211
DuckDB query planning: aggregate pushdown by @mach-kernel in #8174
install.sh improvements by @lukekim in #8252
Fix .clear history by @lukekim in #8254
Improve the output of dataset loading by @lukekim in #8256
Refactor view validation by @lukekim in #8258
Upgrade AWS crates by @lukekim in #8259
fix: Pushdown dynamic filters to partition scans by @peasee in #8240
Harden SQL identifier quoting in connectors by @phillipleblanc in #8276
Cayenne sort_columns on insert by @lukekim in #8091
Redact token debug output by @phillipleblanc in #8280
fix: Cayenne configuration options by @lukekim in #8281
Prevent path traversal in untar by @phillipleblanc in #8284
Fix cluster mode executor registration by @mach-kernel in #8292
Unignore s3_vectors_kafka_stream test by @Jeadie in #8289
Post-release house keeping by @krinart in #8293
Improve generate_changelog script by @krinart in #8273
Acceleration mode caching by @lukekim in #8237
Sanitization and security checks by @lukekim in #7854
Add health probe latency metric by @phillipleblanc in #8300
Add distributed registration for data accelerators by @phillipleblanc in #8299
Pass IndexedTableProvider down in 'changes_stream' and 'append_stream' by @Jeadie in #8295
Add dynamodb-streams crate by @krinart in #8283
Distributed query: resolve executor hostname when determining Flight server binding by @mach-kernel in #8304
Return computed embeddings from index for partitioned S3Vectors by @Jeadie in #8306
[DDB Streams] Skeleton for DynamoDB Streams by @krinart in #8296
DistributeFileScanOptimizer: Improve planning performance by @mach-kernel in #8305
feat: Add an ExactLeftAccumulator implementation by @peasee in #8302
deps: Upgrade Vortex to 0.56 by @peasee in #8311
DynamoDB table bootstrapping + streaming by @krinart in #8312
Avoid calling S3Vector list_vectors (or equivalent) when indexing into VectorIndexs by @Jeadie in #8282
Add on_conflict testing support to append benchmark by @sgrebnov in #8314
docker: Add valid home directory to fix duckdb extension loading issue by @phillipleblanc in #8318
Add GH Workflow to run Append benchmark test by @sgrebnov in #8321
Exclude MySQL SF100 from test-operator dispatch by @sgrebnov in #8320
feat: Update clippy lints by @peasee in #8317
Add S3 location predicate pruning to listing connector by @phillipleblanc in #8319
Review feedback for caching mode accelerator by @phillipleblanc in #8326
Also include Dockerfile home changes for release build by @phillipleblanc in #8327
Change communication channel from Discord to Slack by @Jeadie in #8330
Replace Discord link with Slack link in README by @Jeadie in #8331
fix(glue): Prevent OpenDAL from automatic loading of AWS credentials from environment by @sgrebnov in #8337
Block on index read for FTS queries by @Jeadie in #8339
Fix search query provider by @Jeadie in #8343
Support for writing into AWS S3 Tables by @sgrebnov in #8344
Acceleration file_create mode by @lukekim in #8347
Don't block on lock in FTS query path by @Jeadie in #8348
feat: Add an optimizer rule to replace join accumulator for Cayenne by @peasee in #8316
S3 Vectors limit updates by @lukekim in #8352
Sanitize top_n_sample order_by parsing by @phillipleblanc in #8356
Add distributed registration for data connectors by @phillipleblanc in #8354
Improve IndexerExec to properly handle limit pushdown by @sgrebnov in #8366
Fix Cayenne partition_by metadata flaky integration test by @phillipleblanc in #8367
Rework caching accelerator to use the stale-while-revalidate pattern. by @phillipleblanc in #8365
Add TinyLFU caching policy by @lukekim in #8370
Make Arrow acceleration on_conflict verification more robust by @sgrebnov in #8375
Add additional test for verify_on_conflict_matches_primary_key (Arrow acceleration) by @sgrebnov in #8376
Add v1.10.0-rc1 release notes by @mach-kernel in #8373
docs: Remove DuckDB agg pushdown from release notes by @peasee in #8383
Testoperator dispatch: add Append support and test configurations by @sgrebnov in #8360
fix: Increase TPCDS DuckDB connection pool size by @peasee in #8386
fix: Update benchmark snapshots by @app/github-actions in #8385
fix: Update benchmark snapshots by @app/github-actions in #8389
Fix Windows build by @phillipleblanc in #8391
DuckDB aggregate pushdown: fix partitioning and schema rewrite bugs by @mach-kernel in #8397
Delta table: Store current snapshot ref with table instance by @mach-kernel in #8358
GetAppDefinition: Check if executor is part of cluster by @mach-kernel in #8396
1.10.0-rc1 housekeeping by @mach-kernel in #8394
Change debug log to warning for vector engine config by @Jeadie in #8378
Clarify /v1/nsql datasets sampling hint by @phillipleblanc in #8395
Use bmi1 target feature for x86_64 by @phillipleblanc in #8401
benchmarks: Default to update snapshots when run on a non-release branch by @phillipleblanc in #8402
Update threat model for v1.9.2 by @phillipleblanc in #8400
Fix iceberg tables metadata - assign ids to all fields, including nested by @ewgenius in #8351
Fix databricks_spark_connect_m2m_integration_test_catalog snapshot by @ewgenius in #8403
Move all GitHub Actions workflows to use exact commit sha by @phillipleblanc in #8409
Upgrade datafusion-tableproviders (df v50) by @lukekim in #8261
Batching for CDC by @krinart in #8359
fix: make DuckDB attachments logic more robust by @sgrebnov in #8411
Persistent checkpoints for DynamoDB Streams by @krinart in #8345
Distributed query: Support AsyncFuncExec and Spice UDFs in Ballista by @mach-kernel in #8414
Pin GitHub Actions to fix Testoperator build action by @sgrebnov in #8416
Watermarks support for DynamoDB by @krinart in #8417
Fix typo in .vscode/launch.json by @sgrebnov in #8415
DuckSqlExec: Update equivalence properties when rewriting schema by @phillipleblanc in #8420
Validate that the commit for datafusion-table-providers exists on the spiceai branch by @phillipleblanc in #8421
Append Tests: add support for retention testing by @sgrebnov in #8419
New crate google-genai by @Jeadie in #8390
federation: Improve error message and add debug logging for cast failures by @phillipleblanc in #8422
DynamoDB Streams Error Handling by @krinart in #8418
Append tests: add support for with_retention_data to dispatch by @sgrebnov in #8430
Append tests: add support for test metrics reporting by @sgrebnov in #8432
test-operator: fix metrics reporting by @sgrebnov in #8435
Follow-up improvements and bug fixes by @krinart in #8433
Periodic snapshots for append/changes streams by @krinart in #8407
Add support for caching_stale_if_error to caching accelerator; fix multiple upstream requests during SWR; fix Arrow accelerator by @phillipleblanc in #8425
Disable dataset health monitor for dynamic HTTP connector by @phillipleblanc in #8441
Metrics + snapshots_trigger_threshold for DynamoDB Streams by @krinart in #8437
Clear the in-flight revalidations cache after a revalidation has completed. by @phillipleblanc in #8443
dont build 'spicepod-validator' on 'make install' by @Jeadie in #8426
fix: Update benchmark snapshots by @app/github-actions in #8436
fix: Disable Cayenne HashJoin rewriter optimizer by @peasee in #8439
Testoperator: add duckdb-partitioned query override by @sgrebnov in #8446
Add a check to validate that results cache SWR and caching accelerator SWR are not both set. by @phillipleblanc in #8445
OTel exporter for push metrics by @lukekim in #8442
fix: Update benchmark snapshots by @app/github-actions in #8448
Add snapshot creation logging by @krinart in #8469
Fix PeriodicReader panic by @krinart in #8471
fix: Pin CUDA build actions to commits by @peasee in #8477
DuckDB agg pushdown: gate behind accelerator parameter by @mach-kernel in #8474
Rename aggregate_pushdown_optimization -> optimizer_duckdb_aggregate_pushdown by @ewgenius in #8485
Handle throttling exception for DynamoDB streams by @phillipleblanc in #8492

Spice v1.10.0-rc.1 (Dec 2, 2025)

December 3, 2025 · 11 min read

David Stancu

Principal Software Engineer at Spice AI

Announcing the release of Spice v1.10.0-rc.1! ⚡

v1.10.0-rc1 is a release candidate for early testing of v1.10 features including an all new caching acceleration mode, tiny_lfu caching policy, a new DynamoDB Streams connector (Preview), improvements to the DynamoDB connector, faster distributed query execution, S3 connector improvements, and security hardening for v1.10.0-stable.

What's New in v1.10.0-rc1

Caching Acceleration Mode with SWR and TinyLFU

This release introduces a new caching acceleration mode that implements the stale-while-revalidate (SWR) pattern using Data Accelerators such as DuckDB or Cayenne, enabling queries to return file-persisted cached results immediately while asynchronously refreshing data in the background. Combined with the new TinyLFU cache eviction policy, Spice can now maintain higher cache hit rates while keeping memory usage predictable.

Key Features:

Stale-While-Revalidate (SWR): Returns cached data immediately while refreshing in the background
Data Accelerator Support: Cached accelerators can persist data to disk using DuckDB, SQLite, or Cayenne file modes.
TinyLFU Cache Policy: Probabilistic cache admission policy that maintains high hit rates with minimal overhead
Predictable Memory Usage: Configurable memory limits with automatic eviction of less frequently used entries

Example Spicepod.yml configuration:

runtime:
  caching:
    sql_results:
      enabled: true
      eviction_policy: tiny_lfu # default lru

datasets:
  - from: s3://my-bucket/data.parquet
    name: cached_data
    acceleration:
      enabled: true
      engine: duckdb
      mode: file # Persist cache to disk
      refresh_mode: caching
      refresh_check_interval: 10m

For more details, refer to the Data Acceleration Documentation and Caching Documentation.

DynamoDB Streams Data Connector in Preview

DynamoDB Connector now integrates with DynamoDB Streams which enables real-time streaming with support for both table bootstrapping and continuous change data capture (CDC). This connector automatically detects changes in DynamoDB tables and streams them into Spice for real-time query, search, and LLM-inference.

Key Features:

Real-Time CDC: Automatically captures inserts, updates, and deletes from DynamoDB tables
Table Bootstrapping: Initial full table load before streaming changes

Example Spicepod.yml configuration:

datasets:
  - from: dynamodb:my_table
    name: orders_stream
    acceleration:
      enabled: true
      refresh_mode: changes

For more details, refer to the DynamoDB Connector Documentation.

Cayenne Accelerator Enhancements

The Cayenne data accelerator now supports:

Sort Columns Configuration: Optimize inserts by pre-sorting data on specified columns for improved query performance

Example Spicepod.yml configuration:

datasets:
  - from: s3://my-bucket/data.parquet
    name: sorted_data
    acceleration:
      enabled: true
      engine: cayenne
      mode: file_create
      params:
        sort_columns: timestamp,region

For more details, refer to the Cayenne Documentation.

S3 Connector Improvements

S3 Location Predicate Pruning: The S3 data connector now supports location-based predicate pruning, dramatically reducing data scanned by pushing down predicates to S3 listing operations. This optimization is especially effective for partitioned datasets stored in S3.

AWS S3 Tables Write Support: Full read/write capability for AWS S3 Tables, enabling fast integration with AWS's table format for S3.

For more details, refer to the S3 Tables Data Connector Documentation and Glue Data Connection Documentation.

Faster Distributed Query Execution

Distributed query planning and execution have been significantly improved:

Fixed executor registration in cluster mode for more reliable distributed deployments
Improved hostname resolution for Flight server binding, enabling better executor discovery
Distributed accelerator registration: Data accelerators now properly register in distributed mode
Optimized query planning: DistributeFileScanOptimizer improvements for faster planning with large datasets

For more details, refer to the Distributed Query Documentation.

Search Improvements

Search capabilities have been improved with several performance and reliability enhancements:

Fixed FTS query blocking: Full-text search queries no longer block unnecessarily, improving query responsiveness
Optimized vector index operations: Eliminated unnecessary list_vectors calls for better performance
Improved limit pushdown: IndexerExec now properly handles limit pushdown for more efficient searches

For more details, refer to the Search Documentation.

Security Hardening

Multiple security improvements have been implemented:

SQL identifier quoting: Hardened SQL identifier quoting across all connectors to prevent injection attacks
Token redaction: Sensitive tokens are now fully redacted in debug output to prevent credential leakage
Path traversal prevention: Fixed tar extraction to prevent path traversal vulnerabilities
Input sanitization: Added validation for top_n_sample order_by parsing
Improved credential handling: Improved credential management in Glue connector

Developer Experience Improvements

Health probe metrics: Added health probe latency metrics for better observability
CLI improvements: Fixed .clear history command in the REPL to fully clear persisted history

Contributors

Breaking Changes

No breaking changes.

Cookbook Updates

No major cookbook updates. The Spice Cookbook still offers 82+ recipes to help you prototype quickly.

Upgrading

To try v1.10.0-rc1, use one of the following methods:

CLI:

spice upgrade --version 1.10.0-rc1

Homebrew:

brew upgrade spiceai/spiceai/spice

Docker:

Pull the spiceai/spiceai:1.10.0-rc1 image:

docker pull spiceai/spiceai:1.10.0-rc1

For available tags, see DockerHub.

Helm:

helm repo update
helm upgrade spiceai spiceai/spiceai --version 1.10.0-rc1

AWS Marketplace:

🎉 Spice is available in the AWS Marketplace.

What's Changed

Changelog

Test-operator: Add tpcds_q8 to the default row-count validation skip list by @sgrebnov in #8185
fix: Remove unwrap_used from test by @peasee in #8212
Run glue_iceberg_integration_test_catalog as part of main integration tests by @sgrebnov in #8222
Add TPCH sf100 testoperator spicepods with dispatch by @Jeadie in #8192
Build with CPU native flags by @lukekim in #8224
Make copilot check for empty copyright header by @krinart in #8245
fix: Apply assertion clippy in CI/Makefile only by @peasee in #8229
feat: Support running queries only in testoperator by @peasee in #8211
DuckDB query planning: aggregate pushdown by @mach-kernel in #8174
install.sh improvements by @lukekim in #8252
Fix .clear history by @lukekim in #8254
fix: Pushdown dynamic filters to partition scans by @peasee in #8240
Harden SQL identifier quoting in connectors by @phillipleblanc in #8276
Cayenne sort_columns on insert by @lukekim in #8091
Redact token debug output by @phillipleblanc in #8280
fix: Cayenne configuration options by @lukekim in #8281
Prevent path traversal in untar by @phillipleblanc in #8284
Fix cluster mode executor registration by @mach-kernel in #8292
Unignore s3_vectors_kafka_stream test by @Jeadie in #8289
Post-release house keeping by @krinart in #8293
Improve generate_changelog script by @krinart in #8273
Acceleration mode caching by @lukekim in #8237
Sanitization and security checks by @lukekim in #7854
Add health probe latency metric by @phillipleblanc in #8300
Add distributed registration for data accelerators by @phillipleblanc in #8299
Pass IndexedTableProvider down in 'changes_stream' and 'append_stream' by @Jeadie in #8295
Add dynamodb-streams crate by @krinart in #8283
Distributed query: resolve executor hostname when determining Flight server binding by @mach-kernel in #8304
Return computed embeddings from index for partitioned S3Vectors by @Jeadie in #8306
[DDB Streams] Skeleton for DynamoDB Streams by @krinart in #8296
DistributeFileScanOptimizer: Improve planning performance by @mach-kernel in #8305
feat: Add an ExactLeftAccumulator implementation by @peasee in #8302
deps: Upgrade Vortex to 0.56 by @peasee in #8311
DynamoDB table bootstrapping + streaming by @krinart in #8312
Avoid calling S3Vector list_vectors (or equivalent) when indexing into VectorIndexs by @Jeadie in #8282
Add on_conflict testing support to append benchmark by @sgrebnov in #8314
docker: Add valid home directory to fix duckdb extension loading issue by @phillipleblanc in #8318
Add GH Workflow to run Append benchmark test by @sgrebnov in #8321
Exclude MySQL SF100 from test-operator dispatch by @sgrebnov in #8320
feat: Update clippy lints by @peasee in #8317
Add S3 location predicate pruning to listing connector by @phillipleblanc
Review feedback for caching mode accelerator by @phillipleblanc in #8326
Also include Dockerfile home changes for release build by @phillipleblanc in #8327
Change communication channel from Discord to Slack by @Jeadie in #8330
Replace Discord link with Slack link in README by @Jeadie in #8331
fix(glue): Prevent OpenDAL from automatic loading of AWS credentials from environment by @sgrebnov in #8337
Block on index read for FTS queries by @Jeadie in #8339
Fix search query provider by @Jeadie in #8343
Support for writing into AWS S3 Tables by @sgrebnov in #8344
Acceleration file_create mode by @lukekim in #8347
Don't block on lock in FTS query path by @Jeadie in #8348
feat: Add an optimizer rule to replace join accumulator for Cayenne by @peasee in #8316
S3 Vectors limit updates by @lukekim in #8352
Sanitize top_n_sample order_by parsing by @phillipleblanc in #8356
Update version to v1.10.0-rc.1 by @ewgenius in #8362
Improve IndexerExec to properly handle limit pushdown by @sgrebnov in #8366
Fix Cayenne partition_by metadata flaky integration test by @phillipleblanc in #8367
Rework caching accelerator to use the stale-while-revalidate pattern. by @phillipleblanc in #8365
Add TinyLFU caching policy by @lukekim in #8370

Spice v1.1.1 (Apr 7, 2025)

April 8, 2025 · 6 min read

Phillip LeBlanc

Co-Founder and CTO of Spice AI

Announcing the release of Spice v1.1.1! 📊

Spice v1.1.1 introduces several key updates, including a new Component Metrics System, improved Delta Data Connector performance, improved MCP tool descriptions, and expanded runtime results caching options. This release also adds detailed MySQL connection pool metrics for better observability. Component Metrics are Prometheus-compatible and accessible via the metrics endpoint.

Highlights v1.1.1

Component Metrics System: A new system for monitoring components, starting with MySQL connection pool metrics. These metrics provide insights into MySQL connection performance and can be selectively enabled in the dataset configuration. Metrics are exposed in Prometheus format via the metrics endpoint.

For more details, see the Component Metrics documentation.

Results Caching Enhancements: Added a cache_key_type option for runtime results caching. Options include:
- plan (Default): Uses the query's logical plan as the cache key. Matches semantically equivalent queries but requires query parsing.
- sql: Uses the raw SQL string as the cache key. Provides faster lookups but requires exact string matches. Use sql for predictable queries without dynamic functions like NOW().

Example spicepod.yaml configuration:

runtime:
  results_cache:
    enabled: true
    cache_max_size: 128MiB
    cache_key_type: sql # Use SQL for the results cache key
    item_ttl: 1s

For more details, see the runtime configuration documentation.

Delta Data Connector: Improved scan performance for faster query performance.
MCP Tools: Improved descriptions for built-in MCP tools to improve usability.
MySQL Component Metrics: Added detailed metrics for monitoring MySQL connections, such as connection count and pool activity.

Example spicepod.yaml configuration:

datasets:
  - from: mysql:my_table
    name: my_dataset
    metrics:
      - name: connection_count
        enabled: true
      - name: connections_in_pool
        enabled: true
      - name: active_wait_requests
        enabled: true
    params:
      mysql_host: localhost
      mysql_tcp_port: 3306
      mysql_user: root
      mysql_pass: ${secrets:MYSQL_PASS}

For more details, see the MySQL Data Connector documentation.

spice.js SDK: The spice.js SDK has been updated to v2.0.1 and includes several important security updates.

New Contributors 🎉

@kczimm made their first contribution in #5243

Contributors

Breaking Changes

No breaking changes in this release.

Cookbook Updates

The Spice Cookbook now includes 65 recipes to help you get started with Spice quickly and easily.

Upgrading

To upgrade to v1.1.1, use one of the following methods:

CLI:

spice upgrade

Homebrew:

brew upgrade spiceai/spiceai/spice

Docker:

Pull the spiceai/spiceai:1.1.1 image:

docker pull spiceai/spiceai:1.1.1

For available tags, see DockerHub.

Helm:

helm repo update
helm upgrade spiceai spiceai/spiceai

What's Changed

Dependencies

No major dependency changes.

Changelog

- fix: Testoperator DuckDB, SQLite, Postgres, Spicecloud by [@peasee](https://github.com/peasee) in [#5190](https://github.com/spiceai/spiceai/pull/5190)
- Update Helm Chart and SECURITY.md to v1.1.0 by [@lukekim](https://github.com/lukekim) in [#5223](https://github.com/spiceai/spiceai/pull/5223)
- Update version.txt to v1.1.1-unstable by [@lukekim](https://github.com/lukekim) in [#5224](https://github.com/spiceai/spiceai/pull/5224)
- Update Cargo.lock to v1.1.1-unstable by [@lukekim](https://github.com/lukekim) in [#5225](https://github.com/spiceai/spiceai/pull/5225)
- Add tests for `verify_schema_source_path` in `ListingTableConnector` by [@phillipleblanc](https://github.com/phillipleblanc) in [#5221](https://github.com/spiceai/spiceai/pull/5221)
- Reduce noise from debug logging by [@phillipleblanc](https://github.com/phillipleblanc) in [#5227](https://github.com/spiceai/spiceai/pull/5227)
- Improve `openai_test_chat_messages` integration test reliability by [@Sevenannn](https://github.com/Sevenannn) in [#5222](https://github.com/spiceai/spiceai/pull/5222)
- Verify the checkpoints existence before shutting down runtime in integration tests directly querying checkpoint by [@Sevenannn](https://github.com/Sevenannn) in [#5232](https://github.com/spiceai/spiceai/pull/5232)
- Fix CORS support for json content-type api by [@sgrebnov](https://github.com/sgrebnov) in [#5241](https://github.com/spiceai/spiceai/pull/5241)
- Fix ModelGradedScorer error: The 'metadata' parameter is only allowed when 'store' is enabled. by [@sgrebnov](https://github.com/sgrebnov) in [#5231](https://github.com/spiceai/spiceai/pull/5231)
- fix: Use `pulls-with-spice-action` and switch to `spiceai-macos` runners by [@peasee](https://github.com/peasee) in [#5238](https://github.com/spiceai/spiceai/pull/5238)
- Use v1.0.3 pulls with spice action by [@lukekim](https://github.com/lukekim) in [#5244](https://github.com/spiceai/spiceai/pull/5244)
- feat: Build ODBC binaries, run testoperator on ODBC by [@peasee](https://github.com/peasee) in [#5237](https://github.com/spiceai/spiceai/pull/5237)
- Bump timeout for several integration test runtime load_components & readiness check by [@Sevenannn](https://github.com/Sevenannn) in [#5229](https://github.com/spiceai/spiceai/pull/5229)
- Validate port is available before binding port for docker container in integration tests by [@Sevenannn](https://github.com/Sevenannn) in [#5248](https://github.com/spiceai/spiceai/pull/5248)
- Update datafusion-table-providers to fix the schema for PostgreSQL materialized views by [@ewgenius](https://github.com/ewgenius) in [#5259](https://github.com/spiceai/spiceai/pull/5259)
- Verify flight server is ready for flight integration tests by [@Sevenannn](https://github.com/Sevenannn) in [#5240](https://github.com/spiceai/spiceai/pull/5240)
- fix: Publish to MinIO inside of matrix on build_and_release by [@peasee](https://github.com/peasee) in [#5258](https://github.com/spiceai/spiceai/pull/5258)
- fix: TPCDS on zero results benchmarks by [@peasee](https://github.com/peasee) in [#5263](https://github.com/spiceai/spiceai/pull/5263)
- Use model as a judge scorer for Financebench by [@sgrebnov](https://github.com/sgrebnov) in [#5264](https://github.com/spiceai/spiceai/pull/5264)
- Fix FinanceBench llm scorer secret name by [@sgrebnov](https://github.com/sgrebnov) in [#5276](https://github.com/spiceai/spiceai/pull/5276)
- Implements support for `runtime.results_cache.cache_key_type` by [@phillipleblanc](https://github.com/phillipleblanc) in [#5265](https://github.com/spiceai/spiceai/pull/5265)
- fix: Testoperator MS SQL, query overrides, dispatcher by [@peasee](https://github.com/peasee) in [#5279](https://github.com/spiceai/spiceai/pull/5279)
- refactor: Delete old benchmarks by [@peasee](https://github.com/peasee) in [#5283](https://github.com/spiceai/spiceai/pull/5283)
- Imporve embedding column parsing performance test by [@Sevenannn](https://github.com/Sevenannn) in [#5268](https://github.com/spiceai/spiceai/pull/5268)
- Add Support for AWS Session Token in S3 Data Connector by [@kczimm](https://github.com/kczimm) in [#5243](https://github.com/spiceai/spiceai/pull/5243)
- Implement Component Metrics system + MySQL connection pool metrics by [@phillipleblanc](https://github.com/phillipleblanc) in [#5290](https://github.com/spiceai/spiceai/pull/5290)
- Add default descriptions to built-in MCP tools by [@lukekim](https://github.com/lukekim) in [#5293](https://github.com/spiceai/spiceai/pull/5293)
- fix: Vector search with cased columns by [@peasee](https://github.com/peasee) in [#5295](https://github.com/spiceai/spiceai/pull/5295)
- Run delta kernel scan in a blocking Tokio thread. by [@phillipleblanc](https://github.com/phillipleblanc) in [#5296](https://github.com/spiceai/spiceai/pull/5296)
- Expose the `mysql_pool_min` and `mysql_pool_max` connection pool parameters by [@phillipleblanc](https://github.com/phillipleblanc) in [#5297](https://github.com/spiceai/spiceai/pull/5297)
- use patched pdf-extract by [@kczimm](https://github.com/kczimm) in [#5270](https://github.com/spiceai/spiceai/pull/5270)

Full Changelog: v1.1.0...v1.1.1

What's New in v1.10.2​

Tiered Caching with Localpod​

Periodic Acceleration Snapshots​

DynamoDB JSON Nesting​

Kafka/Debezium Batching​

Additional Improvements & Bug Fixes​

Contributors​

Breaking Changes​

Cookbook Updates​

Upgrading​

What's Changed​

Changelog​

What's New in v1.10.0​

Caching Acceleration Mode​

TinyLFU Cache Eviction Policy​

DynamoDB Streams Data Connector (Preview)​

OpenTelemetry Metrics Exporter​

S3 Connector Improvements​

Faster Distributed Query Execution​

Search Improvements​

Security Hardening​

Developer Experience Improvements​

Contributors​

Breaking Changes​

Cookbook Updates​

Upgrading​

What's Changed​

Changelog​

What's New in v1.10.0-rc1​

Caching Acceleration Mode with SWR and TinyLFU​

DynamoDB Streams Data Connector in Preview​

Cayenne Accelerator Enhancements​

S3 Connector Improvements​

Faster Distributed Query Execution​

Search Improvements​

Security Hardening​

Developer Experience Improvements​

Contributors​

Breaking Changes​

Cookbook Updates​

Upgrading​

What's Changed​

Changelog​

Highlights v1.1.1​

New Contributors 🎉​

Contributors​

Breaking Changes​

Cookbook Updates​

Upgrading​

What's Changed​

Dependencies​

Changelog​

What's New in v1.10.2

Tiered Caching with Localpod

Periodic Acceleration Snapshots

DynamoDB JSON Nesting

Kafka/Debezium Batching

Additional Improvements & Bug Fixes

Contributors

Breaking Changes

Cookbook Updates

Upgrading

What's Changed

Changelog

What's New in v1.10.0

Caching Acceleration Mode

TinyLFU Cache Eviction Policy

DynamoDB Streams Data Connector (Preview)

OpenTelemetry Metrics Exporter

S3 Connector Improvements

Faster Distributed Query Execution

Search Improvements

Security Hardening

Developer Experience Improvements

Contributors

Breaking Changes

Cookbook Updates

Upgrading

What's Changed

Changelog

What's New in v1.10.0-rc1

Caching Acceleration Mode with SWR and TinyLFU

DynamoDB Streams Data Connector in Preview

Cayenne Accelerator Enhancements

S3 Connector Improvements

Faster Distributed Query Execution

Search Improvements

Security Hardening

Developer Experience Improvements

Contributors

Breaking Changes

Cookbook Updates

Upgrading

What's Changed

Changelog

Highlights v1.1.1

New Contributors 🎉

Contributors

Breaking Changes

Cookbook Updates

Upgrading

What's Changed

Dependencies

Changelog