Home

Awesome

<h1 align="center"> <img src="https://raw.githubusercontent.com/ArroyoSystems/arroyo/760aabdbdb019d95f0c5ebb60933233aa735f830/images/arroyo_logo.png" width="400px" alt="Arroyo" /> </h1> <h4 align="center"> <a href="https://arroyo.dev/">Arroyo Cloud</a> | <a href="https://doc.arroyo.dev/getting-started">Getting started</a> | <a href="https://doc.arroyo.dev">Docs</a> | <a href="https://discord.gg/cjCr5rVmyR">Discord</a> | <a href="https://arroyo.dev">Website</a> </h4> <h4 align="center"> <a href="https://github.com/ArroyoSystems/arroyo/blob/master/LICENSE-APACHE"> <img src="https://img.shields.io/badge/license-MIT%2FApache--2.0-orange" alt="Arroyo is dual-licensed under Apache 2 and MIT licenses." /> </a> <a href="https://github.com/ArroyoSystems/arroyo/blob/master/CONTRIBUTING.md"> <img src="https://img.shields.io/badge/PRs-Welcome-brightgreen" alt="PRs welcome!" /> </a> <a href="https://github.com/ArroyoSystems/arroyo/commits"> <img src="https://img.shields.io/github/commit-activity/m/ArroyoSystems/arroyo" alt="git commit activity" /> </a> <img alt="CI" src="https://github.com/ArroyoSystems/arroyo/actions/workflows/ci.yml/badge.svg"> <a href="https://github.com/ArroyoSystems/arroyo/releases"> <img alt="GitHub release (latest by date)" src="https://img.shields.io/github/v/release/ArroyoSystems/arroyo?display_name=release"> </a> </h4>

Arroyo is a distributed stream processing engine written in Rust, designed to efficiently perform stateful computations on streams of data. Unlike traditional batch processing, streaming engines can operate on both bounded and unbounded sources, emitting results as soon as they are available.

In short: Arroyo lets you ask complex questions of high-volume real-time data with subsecond results.

running job

Features

πŸ¦€ SQL and Rust pipelines

πŸš€ Scales up to millions of events per second

πŸͺŸ Stateful operations like windows and joins

πŸ”₯State checkpointing for fault-tolerance and recovery of pipelines

πŸ•’ Timely stream processing via the Dataflow model

Use cases

Some example use cases include:

Why Arroyo

There are already a number of existing streaming engines out there, including Apache Flink, Spark Streaming, and Kafka Streams. Why create a new one?

Getting Started

You can get started with a single node Arroyo cluster by running the following docker command:

$ docker run -p 5115:5115 ghcr.io/arroyosystems/arroyo:latest

or you can download a binary for your platform from the releases page.

For example, for MacOS:

$ curl https://github.com/ArroyoSystems/arroyo/releases/download/v0.11.0/arroyo-macos-m1 -Lo arroyo && chmod +x arroyo
$ ./arroyo cluster

Then, load the Web UI at http://localhost:5115.

For a more in-depth guide, see the getting started guide.

Once you have Arroyo running, follow the tutorial to create your first real-time pipeline.

Developing Arroyo

We love contributions from the community! See the developer setup guide to get started, and reach out to the team on discord or create an issue.

Community

Arroyo Cloud

Don't want to self-host? Arroyo Systems provides fully-managed cloud hosting for Arroyo. Sign up here.