Skip to main content

IMAP Data Connector

The IMAP Data Connector enables federated SQL query across emails stored in an IMAP email server.

datasets:
- from: imap:[email protected]
name: emails
params:
imap_access_token: ${secrets:IMAP_ACCESS_TOKEN}

Schema​

Field NameData TypeNullableDescription
dateDate64NoThe date and time when the email was sent.
subjectUtf8YesThe subject line of the email.
fromList<Utf8>YesThe sender(s) of the email.
toList<Utf8>YesThe primary recipient(s) of the email.
ccList<Utf8>YesThe carbon copy recipient(s) of the email.
bccList<Utf8>YesThe blind carbon copy recipient(s) of the email.
reply_toList<Utf8>YesThe email address(es) to which replies should be sent.
message_idUtf8YesA unique identifier for the email message.
in_reply_toUtf8YesThe message_id of the email this message is replying to, if applicable.
contentUtf8YesThe raw email body of this message. Not retrieved when acceleration is disabled.

If a MIME-encoded value is retrieved for a field, it is not decoded and the MIME-encoded value is returned in SQL queries.

Most fields are optional, and depend on the implementation of the specific IMAP server being connected to. For example, the IMAP RFC specifies the message_id field SHOULD be supplied but the field is an optional field.

For more information, refer to the IMAP RFC 2822 - Section 3.6.

Retrieving email body contents​

When the IMAP Data Connector is used without acceleration, the email body will not be retrieved - only header/subject values. To load the email body contents, specify an acceleration:

datasets:
- from: imap:[email protected]
name: emails
params:
imap_access_token: ${secrets:IMAP_ACCESS_TOKEN}
acceleration:
enabled: true

With an acceleration enabled, the content field will be populated with the complete email body including headers, without any decoding applied. This field could be used for post-processing the email, like retrieving custom header values or decoding MIME-encoded content.

Limitations
  • Email attachments are currently not parsed from the email body into separate dataset fields. To read email attachments, parse the multipart encodings from the content field.

Configuration​

from​

The from field must contain the email address for the mailbox to connect to. For example, [email protected], or [email protected].

name​

The dataset name. This will be used as the table name within Spice.

Example:

datasets:
- from: imap:[email protected]
name: emails
params: ...
SELECT COUNT(*) FROM emails;
+----------+
| count(*) |
+----------+
| 1234 |
+----------+

params​

The IMAP connector supports the following connection and authentication parameters:

Parameter NameDescription
imap_usernameOptional. The username to use for the IMAP connection. Defaults to the value of the from: mailbox field.
imap_access_tokenOptional. The OAuth access token to use for the IMAP connection, to connect to OAuth-enabled IMAP servers (like Outlook, or Gmail).
imap_passwordOptional. The password to use for the IMAP connection, in plaintext authentication mode.
imap_hostOptional. The host or IP address of the IMAP server to connect to. Not required for known connections like Outlook or Gmail.
imap_portOptional. The port of the IMAP server to connect to.
imap_mailboxOptional. The mailbox to read mail from. Defaults to INBOX, the standard email inbox.
imap_ssl_modeOptional. The IMAP SSL mode to use. Defaults to tls, permitted values of tls, starttls, disabled or auto.

Examples​

Basic example​

datasets:
- from: imap:[email protected]
name: emails
params:
imap_host: mail.example.com
imap_password: ${ secrets:IMAP_PASSWORD }

Using OAuth authentication​

datasets:
- from: imap:[email protected]
name: emails
params:
imap_access_token: ${ secrets:IMAP_ACCESS_TOKEN }

Secrets​

Spice integrates with multiple secret stores to help manage sensitive data securely. For detailed information on supported secret stores, refer to the secret stores documentation. Additionally, learn how to use referenced secrets in component parameters by visiting the using referenced secrets guide.

Cookbook​