You are viewing an old version of this page. View the current version.

Compare with Current View Page History

« Previous Version 10 Next »


Inference Performance

This group of the performance test is gathered on AWS EC2 instances in 1 socket.

  • Performance boost with Intel MKL-DNN backend in release 1.3

    • w/o MKL-DNN, pip install mxnet==1.3.0

    • w/ MKL-DNN, pip install mxnet-mkl==1.3.0

Category

Model

Latency batchsize=1 (ms, small is better)Throughput batchsize=128 (fps, big is better)
w/o MKL-DNNw/ MKL-DNNspeedupw/o MKL-DNNw/ MKL-DNNspeedup

CNN/classification

ResNet-50 v1





ResNet-50 v2





Inception v3





Inception v4





DenseNet





MobileNet





VGG16





AlexNet





inception-resnet v2





CNN/object detection

Faster R-CNN





SSD-VGG16





SSD-MobileNet





RNN

GNMT





GANDCGAN






  • Performance gain from operator fusion by subgraph

Category

Model

Latency batchsize=1 (ms, small is better)Throughput batchsize=128 (fps, big is better)
R1.3 w/ MKL-DNNmaster w/ subgraphspeedupR1.3 w/ MKL-DNNmaster w/ subgraphspeedup

CNN/classification

ResNet-50 v1





ResNet-50 v2





Inception v3





Inception v4





DenseNet





MobileNet





VGG16





AlexNet





inception-resnet v2





CNN/object detection

Faster R-CNN





SSD-VGG16





SSD-MobileNet





RNN

GNMT





GANDCGAN




Inference Accuracy


Inference Accuracy Comparison
AliasNetwork# ParametersGPU (with cuDNN) BackendCPU (without MKL-DNN)CPU (with MKL-DNN) Backend
 top1top5 top1 top5 top1 top5
alexnetAlexNet61,100,840      
densenet121DenseNet-1218,062,504      
densenet161DenseNet-16128,900,936      
densenet169DenseNet-16914,307,880      
densenet201DenseNet-20120,242,984      
inceptionv3Inception V3 299x29923,869,000      
mobilenet0.25MobileNet 0.25475,544      
mobilenet0.5MobileNet 0.51,342,536      
mobilenet0.75MobileNet 0.752,601,976      
mobilenet1.0MobileNet 1.04,253,864      
mobilenetv2_1.0MobileNetV2 1.03,539,136      
mobilenetv2_0.75MobileNetV2 0.752,653,864      
mobilenetv2_0.5MobileNetV2 0.51,983,104      
mobilenetv2_0.25MobileNetV2 0.251,526,856      
resnet18_v1ResNet-18 V111,699,112      
resnet34_v1ResNet-34 V121,814,696      
resnet50_v1ResNet-50 V125,629,032      
resnet101_v1ResNet-101 V144,695,144      
resnet152_v1ResNet-152 V160,404,072      
resnet18_v2ResNet-18 V211,695,796      
resnet34_v2ResNet-34 V221,811,380      
resnet50_v2ResNet-50 V225,595,060      
resnet101_v2ResNet-101 V244,639,412      
resnet152_v2ResNet-152 V260,329,140      
squeezenet1.0SqueezeNet 1.01,248,424      
squeezenet1.1SqueezeNet 1.11,235,496      
vgg11VGG-11132,863,336      
vgg13VGG-13133,047,848      
vgg16VGG-16138,357,544      
vgg19VGG-19143,667,240      
vgg11_bnVGG-11 with batch normalization132,874,344      
vgg13_bnVGG-13 with batch normalization133,059,624      
vgg16_bnVGG-16 with batch normalization138,374,440      
vgg19_bnVGG-19 with batch normalization143,689,256      
  • No labels