Skip to main content
Version: Next

Dremio Data Connector Deployment Guide

Production operating guide for the Dremio data connector covering authentication, Flight SQL transport, and operational tuning.

Authentication & Secrets​

The Dremio connector connects over Arrow Flight SQL with username/password or personal-access-token (PAT) authentication.

ParameterDescription
dremio_endpointFlight SQL endpoint, e.g. grpc+tls://dremio.internal:32010.
dremio_usernameDremio user (username/password auth).
dremio_passwordDremio password. Use ${secrets:...} from a secret store.
dremio_tokenAlternatively, a PAT or session token.

Use TLS endpoints (grpc+tls://) in production. Credentials must be sourced from a secret store.

Resilience Controls​

Flight SQL Transport​

Data transfer uses gRPC. Transient UNAVAILABLE / DEADLINE_EXCEEDED errors surface to the caller and rely on the Flight client's default retry policy. Per-operation retry parameters are not exposed at the Spice layer.

Query Pushdown​

Dremio is a federated engine; the connector pushes SQL predicates and projections into Dremio where possible. For heavy analytical joins, execute the join in Dremio (via a Dremio view) rather than pulling raw rows into Spice.

Capacity & Sizing​

  • Network: Flight SQL is gRPC over HTTP/2. Co-locate Spice with Dremio for best latency.
  • Coordinator load: Every Spice query opens a Flight ticket against a Dremio coordinator. For dashboard-heavy workloads, accelerate the dataset in Spice (acceleration: enabled) to offload repeat queries from the coordinator.
  • Result streaming: Large result sets are streamed as Arrow record batches; memory footprint scales with the configured DataFusion batch size, not the result-set total.

Metrics​

Flight SQL transport metrics are collected via the shared Flight client instrumentation. The connector does not currently register Dremio-specific dataset-level instruments. Monitor via:

  • Spice query execution metrics (query_duration_ms, query_processed_rows, query_failures_total) from runtime.metrics.
  • Dremio's own job metrics exposed via the Dremio UI / API (job_id correlation).

See Component Metrics for configuration.

Task History​

Dremio queries participate in task history via Flight client spans. Each Flight request is captured as a child of the enclosing sql_query or accelerated_table_refresh task.

Known Limitations​

  • Read-only: The connector is read-only; writes to Dremio are not supported.
  • Temporary tables: Dremio temporary objects are not visible to Spice; use Dremio views for shared logic.
  • Reflection-aware routing: The connector does not explicitly hint Dremio reflections; they are still applied by the coordinator transparently.

Troubleshooting​

SymptomLikely causeResolution
UNAUTHENTICATED on handshakeWrong credentials or expired PAT.Verify credentials; regenerate the PAT via Dremio UI.
UNAVAILABLE intermittent errorsNetwork partition or coordinator restart.Flight client auto-retries; if persistent, check coordinator health.
PERMISSION_DENIED on a specific datasetDremio role lacks SELECT on the underlying source.Grant access in Dremio via the user/role management UI.
Slow queries for repeated dashboardsCoordinator overloaded by repeat queries.Enable Spice acceleration for the dataset to cache results locally.
TLS handshake failuresSelf-signed cert or missing CA.Configure TLS at the Flight client; ensure the CA bundle is trusted by the Spice runtime.