TechQA.

How to run TF learn (skflow) Kmeans clustering in multi-machine multi-gpu environment?

485 views Asked by Rahul At 11 April 2017 at 07:23

Is there any example / documentation which shows how to implement skflow(tf.contrib.learn) in multi-machine and multi-GPUs environment?

I want to implement KMeansClustring using skflow(tf.contrib.learn) in a multi-machine and multi-GPUs environment. How should I proceed?

There are 1 answers

Jonathan Hseu

Jonathan Hseu On 14 April 2017 at 18:01

Ideally you'd be running a cluster manager (Kubernetes, Mesos), otherwise setting up multi-machine training is rather difficult.

For a core TensorFlow example, see here (under the docker/kubernetes directories): https://github.com/tensorflow/ecosystem

We'll update the examples there to use both Estimator and multi-GPU setups soon.