Home

Awesome

Pyrallel - Parallel Data Analytics in Python

Unmaintained warning: this project has no future, use dask and dask-distributed instead.

Overview: experimental project to investigate distributed computation patterns for machine learning and other semi-interactive data analytics tasks.

Scope:

Disclaimer: the public API of this library will probably not be stable soon as the current goal of this project is to experiment.

Dependencies

The usual suspects: Python 2.7, NumPy, SciPy.

Fetch the development version (master branch) from:

StarCluster develop branch and its IPCluster plugin is also required to easily startup a bunch of nodes with IPython.parallel setup.

Patterns currently under investigation

See the content of the examples/ folder for more details.

License

MIT

History

This project started at the PyCon 2012 PyData sprint as a set of proof of concept IPython.parallel scripts.