Home

Awesome

Binary Classification with Apache Spark / HDFS

<img src="img/logo.png" style="width: 5px;"/> ↖data source

The goal of the competition is to predict which parts will fail quality control

My goal is to utilize the hadoop ecosystem to handle a large dataset and establish a pipeline for machine learning

munge :

fit_predict :

munge_fit_predict :