Skip to content

Latest commit

 

History

History

Folders and files

NameName
Last commit message
Last commit date

parent directory

..
 
 
 
 
 
 
 
 

README.md

Distributed-TensorFlow-Example

About a cluster of TensorFlow servers, and how to distribute a computation graph across that cluster

Requirements

  • Python 3.5.2
  • TensorFlow >= 1.4.0 (tf.data.FixedLengthRecordDataset)
  • horovod

How to Run

python cifar10_download_and_extract.py

srun -n 4 --mpi=pmi2 --partition=k80 --gres=gpu:4 python cifar10_main.py --data_dir=data/cifar10_data

Reference

horovod/examples/tensorflow_mnist.py

models/official/resnet/cifar10_main.py