Skip to main content

🧑‍🍳 Spice.ai OSS Cookbook

A collection of guides and samples to help you build data-grounded AI apps and agents with Spice.ai Open-Source. Find ready-to-use examples for data acceleration, AI agents, LLM memory, and more.

Contribute to the Cookbook on GitHub!

Featured Recipes

Federated Data Access

Use SQL to join data across S3, PostgreSQL, and Dremio in a single query.

Local Llama3

Use the Llama family of models locally from HuggingFace using Spice.

Data Acceleration

Accelerate queries with DuckDB.

LLM Memory

Persistent memory for language models

Sample Applications and Guides

Explore example applications and guides built with Spice.ai that demonstrate real-world usage patterns and best practices.

Command Query Responsibility Segregation (CQRS)

Sample application implementing the CQRS pattern with Spice.

Core Scenarios

Discover core capabilities like data federation, acceleration, search, and LLM inference to enhance your applications.

Federated SQL Query

Query data from S3, PostgreSQL, and Dremio in a single query.

OpenAI SDK

Use the OpenAI SDK to connect to models hosted on Spice.

DuckDB Data Accelerator

Accelerate data locally using DuckDB.

Models, AI, and Agents

Integrate with popular AI models, LLMs, and build intelligent agents using Spice.ai.

Azure OpenAI Models

Connect and use Azure OpenAI models with Spice.

Running Llama3 Locally

Use the Llama family of models locally from HuggingFace using Spice.

OpenAI SDK

Use the OpenAI SDK to connect to models hosted on Spice.

LLM Memory

Persistent memory for language models.

Text to SQL (NSQL)

Ask natural language (NLP) questions of your datasets using the built-in text-to-SQL tool.

Nvidia NIM

Deploy Nvidia NIM infrastructure, on Kubernetes, with GPUs connected to Spice.

Searching GitHub Files

Search GitHub files with embeddings and vector similarity search.

xAI Models

Use xAI models such as Grok.

Data Acceleration, Materialization, and Federation

Optimize query performance with local acceleration, data materialization, and federation techniques.

DuckDB Data Accelerator

Accelerate data locally using DuckDB.

PostgreSQL Data Accelerator

Accelerate data locally using PostgreSQL.

SQLite Data Accelerator

Accelerate data locally using SQLite.

Apache Arrow Data Accelerator

Accelerate data using Apache Arrow.

Indexes on Accelerated Data

Create and manage indexes on accelerated data.

Search and Embeddings

Implement advanced search capabilities and leverage embeddings for vector similarity search.

Searching GitHub Files

Search GitHub files with embeddings and vector similarity search.

Data Connectors

Connect to various data sources and systems to query, analyze, and manage your data efficiently.

PostgreSQL Connector

Connect to and query PostgreSQL databases.

AWS RDS PostgreSQL

Connect to AWS RDS PostgreSQL instances.

Supabase PostgreSQL

Connect to Supabase PostgreSQL databases.

MySQL Connector

Connect to and query MySQL databases.

AWS RDS Aurora MySQL

Connect to AWS RDS Aurora with MySQL compatibility.

PlanetScale MySQL

Connect to PlanetScale MySQL databases.

Clickhouse Connector

Connect to and query Clickhouse databases.

Databricks Connector

Connect to and query Databricks instances using Delta Lake or Spark Connect.

Debezium CDC from Postgres

Stream changes from PostgreSQL using Debezium CDC.

Debezium CDC with SASL/SCRAM

Stream MySQL changes using Debezium with SASL/SCRAM authentication.

Dremio Connector

Connect to and query Dremio.

DuckDB Connector

Query DuckDB databases with sample TPCH data.

File Connector

Query data from local files.

FTP Connector

Query data from FTP servers.

GitHub Connector

Connect to and query GitHub data.

GraphQL Connector

Query data from GraphQL endpoints.

MSSQL Connector

Connect to Microsoft SQL Server databases.

ODBC Connector

Connect to databases using ODBC.

S3 Connector

Query data from S3 compatible storage.

SharePoint Connector

Connect to SharePoint and OneDrive for Business.

Snowflake Connector

Connect to and query Snowflake databases.

Spice.ai Cloud Connector

Connect to the Spice.ai Cloud Platform.

Apache Spark Connector

Connect to and query Apache Spark.

Catalog Connectors

Connect to data catalogs to discover, manage, and utilize your data assets effectively.

Spice.ai Cloud Platform Catalog

Connect to the Spice.ai Cloud Platform catalog.

Databricks Unity Catalog

Connect to Databricks Unity catalog.

Unity Catalog

Connect to Unity catalog.

Iceberg Catalog

Connect to Iceberg catalog.

Data Visualization with Client Integrations

Visualize your data using popular BI and analytics tools integrated with Spice.ai.

Sales BI with Apache Superset

Visualize data in Spice with Apache Superset.

Grafana Datasource

Add Spice as a Grafana datasource.

Deployment

Deploy Spice.ai in various environments and configurations to suit your needs.

Deploying to Kubernetes

Deploy Spice.ai on Kubernetes.

Running in Docker

Run Spice.ai in Docker containers.

Performance and Benchmarking

Measure and optimize performance with benchmarks and best practices for your Spice.ai deployment.

TPC-H Benchmarking

Benchmark performance using TPC-H.

Results Caching

Cache query results for improved performance.

Indexes on Accelerated Data

Create and manage indexes on accelerated data.

Advanced Configuration

Fine-tune your Spice.ai deployment with advanced configuration options for optimal performance.

Data Retention Policy

Configure data retention policies.

Refresh Data Window

Configure data refresh windows.

Advanced Data Refresh

Advanced configuration for data refresh.

Data Quality with Constraints

Add data quality constraints.

SDKs

Utilize development kits and libraries for building applications with Spice.ai across various programming languages.

OpenAI SDK

Use the OpenAI SDK to connect to models hosted on Spice.

Rust SDK

Query Spice.ai using the Rust SDK.

Python SDK

Query Spice.ai using the Python SDK.

Go SDK

Query Spice.ai using the Go SDK.

JavaScript SDK

Query Spice.ai using the JavaScript SDK.

Java SDK

Query Spice.ai using the Java SDK.

Security

Secure your Spice.ai deployment and data access with robust security practices and configurations.

Intelligent Security Copilot

Analyze real-time data access patterns with Spice.ai.

TLS Encryption

Enable encryption in transit using TLS.

API Key Authentication

Secure access with API key authentication.