Home

Awesome

#Distributed TensorFlow on Spark First presented at the 2016 Spark Summit East: [Slide deck] (http://www.slideshare.net/arimoinc/distributed-tensorflow-scaling-googles-deep-learning-library-on-spark-58527889), [Presentation video] (https://www.youtube.com/watch?v=-QtcP3yRqyM), [Blog post] (https://arimo.com/machine-learning/deep-learning/2016/arimo-distributed-tensorflow-on-spark/)

##TensorSpark productionalized in yarn-cluster mode This latest version contains modifications/improvements that are mostly relevant to someone interested in taking TensorSpark to production in yarn-cluster mode (tested with a Hortonworks distribution [HDP 2.4] with CPU machines). For other deployment and machine types, the earlier version as of [Commit #62] (https://github.com/adatao/tensorspark/tree/2eae6732709884f08e800efa24653340f2f7997b) might still be a better option.

###Summary of changes since [Commit #62] (https://github.com/adatao/tensorspark/tree/2eae6732709884f08e800efa24653340f2f7997b) There are few minor improvements (see commits for details) and the following 2 major changes:

###To run

  1. zip pyfiles.zip ./parameterwebsocketclient.py ./parameterservermodel.py ./mnistcnn.py ./mnistdnn.py ./moleculardnn.py ./higgsdnn.py
  2. spark-submit
    <br />--master yarn
    <br />--deploy-mode cluster
    <br />--queue default
    <br />--num-executors 3
    <br />--driver-memory 20g
    <br />--executor-memory 60g
    <br />--executor-cores 8
    <br />--py-files ./pyfiles.zip
    <br />./tensorspark.py

Partial project layout: <br>tensorspark/gpu_install.sh - script to build tf from source with gpu support for aws <br>tensorspark/simple_websocket_*.py - simple tornado websocket example <br>tensorspark/parameterservermodel.py - "abstract" model class that has all tensorspark required methods implemented <br>tensorspark/*dnn.py - specific fully connected models for specific datasets <br>tensorspark/mnistcnn.py - convolutional model for mnist <br>tensorspark/parameterwebsocketclient.py - spark worker code <br>tensorspark/tensorspark.py - entry point and spark driver code