Awesome

duckdb-api

Running DuckDB behind a Hono.js API in a Docker container.

Building the Docker image

docker build -t duckdb-api .

Usage

To build and run the Docker container, use the following commands:

docker run -p 3000:3000 tobilg/duckdb-api:0.1.0

Environment Variables

The following environment variables can be set to configure the API:

PORT: The port to run the API on. Defaults to 3000. (optional)
USERNAME: The username to be used for basic auth. (optional)
PASSWORD: The password to be used for basic auth. (optional)
AWS_REGION: The AWS region to be used for S3 access (optional)
AWS_ACCESS_KEY_ID: The AWS access key ID to be used for S3 access (optional)
AWS_SECRET_ACCESS_KEY: The AWS secret access key to be used for S3 access (optional)

If you want to activate basic auth for the API, set both the USERNAME and the PASSWORD environment variables.

In case you want to use AWS S3, set the AWS_REGION, AWS_ACCESS_KEY_ID and AWS_SECRET_ACCESS_KEY environment variables with the appropriate values (you need to have set up the appropriate credentials beforehand in your AWS account).

API Documentation

There are two endpoints available:

POST /query: Executes a SQL query and returns the result as a JSON object.
POST /streaming-query: Executes a SQL query and streams the Arrow data to the client.

The request body of both endpoints is a JSON object with a query property, which contains the SQL query to execute.

Examples

Simple query to the JSON endpoint

curl -X POST http://localhost:3000/query -H "Content-Type: application/json" -d '{"query": "SELECT 1;"}'

Streaming query from a remote Parquet file

curl --location 'localhost:3000/streaming-query' \
  --header 'Content-Type: application/json' \
  --data '{
    "query": "SELECT * FROM '\''https://shell.duckdb.org/data/tpch/0_01/parquet/orders.parquet'\'' LIMIT 100"
  }'
  --output /tmp/result.arrow