Home

Awesome

Hadron - Hadoop MapReduce in Haskell Build Status

Hadron aims to bring Haskell's type-safety to the complex and delicate world of Hadoop Streaming MapReduce.

Features

Shortcomings and Issues

Hadoop seems to be terrible at constantly changing little details, program flags and behavior across major releases. While we try to make this package as sound as possible, you may be forced to do some debugging due to a difference in the way Hadoop works on the version you are running.

This library has been most commonly tested on Amazon's EMR offering and Cloudera's local demo VM.

Status

hadron is used extensively by Soostone to process datasets with rows in the billions. Improvement opportunities exist, but it is very much functional.

Modules

Hadron.Basic

This module exposes low level functionality for constructing a single MapReduce step. Not recommended for direct use in most cases.

Hadron.Controller

High level module for automated orchestration of multi-stage MapReduce jobs.

(More docs and examples to be added)

Hadron.Protocol

Defines data encode/decode strategies via the Protocol type.

TODO

See TODO.org.

Contributors

Release Notes

Version 0.5