Awesome
Sail
The mission of Sail is to unify stream processing, batch processing, and compute-intensive (AI) workloads. Currently, Sail features a drop-in replacement for Spark SQL and the Spark DataFrame API in single-host settings.
Installation
Sail is available as a Python package on PyPI. You can install it using pip
.
# Quick install
pip install pysail
# Install from source for best performance
# rustup (https://rustup.rs/) and protoc are required
env RUSTFLAGS="-C target-cpu=native" pip install pysail -v --no-binary pysail
You can follow the Getting Started guide to learn more about Sail.
Documentation
The documentation of the latest Sail version can be found here.
Contributing
Contributions are more than welcome!
Please submit GitHub issues for bug reports and feature requests.
Feel free to create a pull request if you would like to make a code change. You can refer to the development guide to get started.
Sail vs. Spark Benchmark
Check out our blog post, Supercharge Spark: Quadruple Speed, Cut Costs by 94%, for detailed benchmark results.
Support
See the Support Options Page for more information.