Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

...

work numserver numPer Node FPS(pic/s)Scaling Efficiency
88(worker and server share node)19.8767.81%
8827.393.17%
8422.777.47%
8211.1137.90%

Command line: python tools/launch.py -n 8 -s <server_num> --launcher ssh -H hosts python example/image-classification/train_vgg16.py --kv-store dist_sync


Following is the result of MXNet multinode with mpi allreduce supported from our proof of concept (ready):

Node NumPer Node FPS(pic/s)Scaling Efficiency
827.7694.74%

Command line: mpirun -n 8 -ppn 1 -machinefile hosts python example/image-classification/train_vgg16.py --kv-store dist_sync_mpi

MPI Allreduce's good scalability comes from the tremendous communication time decrease compared with parameter server if not allocated enough server num.

...