Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

...

For the throughput, 2 sockets can provide about 2X speedup while latency will keep the constant.

Performance boost on Intel CPU with Intel MKL-DNN backend in release 1.3

The c5.18xlarge instance offers a 2-socket Intel Xeon Platinum processor with 72 vCPUs.

...

CategoryModelLatency batchsize=1 (ms, small is better)Throughput batchsize=128 (fps, big is better)
w/o MKL-DNNw/ MKL-DNNspeedupw/o MKL-DNNw/ MKL-DNNspeedup
CNN/classificationResNet-50 v197.1913.047.4510.29163.5215.90
ResNet-50 v298.6913.027.589.94154.1715.51
Inception v3175.1716.7710.445.74135.3323.57
Inception v4330.9331.4010.543.0469.6022.87
DenseNet111.6618.905.918.52149.8817.60
MobileNet38.564.428.7324.87512.2520.60
VGG16406.5020.0720.252.9170.8424.31
AlexNet64.603.8017.0026.58965.2036.32
inception-resnet v2181.1049.403.675.4882.9715.14
CNN/object detectionFaster R-CNN1175.74118.629.910.858.5710.08
SSD-VGG16721.0347.6215.141.43(batchsize=224)28.90(batchsize=224)19.13
SSD-MobileNet239.4028.338.454.07(batchsize=256)69.97(batchsize=256)14.18
RNNGNMT683.4394.007.271.46(batchsize=64)10.63(batchsize=64)6.83
GANDCGAN8.940.2437.85109.134249.3638.94

Inference Accuracy

The c5.18xlarge instance offers a 2-socket Intel Xeon Platinum processor with 72 vCPUs.

The model is from gluon model zoo by pre-trained parameters. The top1 and top5 accuracy are verified by MKL-DNN backend. 

...