You are viewing an old version of this page. View the current version.

Compare with Current View Page History

« Previous Version 9 Next »


Inference Performance

This group of the performance test is gathered on AWS EC2 instances in 1 socket.

  • Performance boost with Intel MKL-DNN backend in release 1.3

    • w/o MKL-DNN, pip install mxnet==1.3.0

    • w/ MKL-DNN, pip install mxnet-mkl==1.3.0

Category

Model

Latency batchsize=1 (ms, small is better)Throughput batchsize=128 (fps, big is better)
w/o MKL-DNNw/ MKL-DNNspeedupw/o MKL-DNNw/ MKL-DNNspeedup

CNN/classification

ResNet-50 v1





ResNet-50 v2





Inception v3





Inception v4





DenseNet





MobileNet





VGG16





AlexNet





inception-resnet v2





CNN/object detection

Faster R-CNN





SSD-VGG16





SSD-MobileNet





RNN

GNMT





GANDCGAN






  • Performance gain from operator fusion by subgraph

Category

Model

Latency batchsize=1 (ms, small is better)Throughput batchsize=128 (fps, big is better)
R1.3 w/ MKL-DNNmaster w/ subgraphspeedupR1.3 w/ MKL-DNNmaster w/ subgraphspeedup

CNN/classification

ResNet-50 v1





ResNet-50 v2





Inception v3





Inception v4





DenseNet





MobileNet





VGG16





AlexNet





inception-resnet v2





CNN/object detection

Faster R-CNN





SSD-VGG16





SSD-MobileNet





RNN

GNMT





GANDCGAN




Inference Accuracy

Inference Accuracy Comparison
AliasNetwork# ParametersOfficial ResultsMKL-DNN Backend
Top-1 AccuracyTop-5 Accuracy  
alexnetAlexNet61,100,8400.54920.7803  
densenet121DenseNet-1218,062,5040.74970.9225  
densenet161DenseNet-16128,900,9360.7770.938  
densenet169DenseNet-16914,307,8800.76170.9317  
densenet201DenseNet-20120,242,9840.77320.9362  
inceptionv3Inception V3 299x29923,869,0000.77550.9364  
mobilenet0.25MobileNet 0.25475,5440.51850.7608  
mobilenet0.5MobileNet 0.51,342,5360.63070.8475  
mobilenet0.75MobileNet 0.752,601,9760.67380.8782  
mobilenet1.0MobileNet 1.04,253,8640.71050.9006  
mobilenetv2_1.0MobileNetV2 1.03,539,1360.71920.9056  
mobilenetv2_0.75MobileNetV2 0.752,653,8640.69610.8895  
mobilenetv2_0.5MobileNetV2 0.51,983,1040.64490.8547  
mobilenetv2_0.25MobileNetV2 0.251,526,8560.50740.7456  
resnet18_v1ResNet-18 V111,699,1120.70930.8992  
resnet34_v1ResNet-34 V121,814,6960.74370.9187  
resnet50_v1ResNet-50 V125,629,0320.76470.9313  
resnet101_v1ResNet-101 V144,695,1440.78340.9401  
resnet152_v1ResNet-152 V160,404,0720.790.9438  
resnet18_v2ResNet-18 V211,695,7960.710.8992  
resnet34_v2ResNet-34 V221,811,3800.7440.9208  
resnet50_v2ResNet-50 V225,595,0600.77110.9343  
resnet101_v2ResNet-101 V244,639,4120.78530.9417  
resnet152_v2ResNet-152 V260,329,1400.79210.9431  
squeezenet1.0SqueezeNet 1.01,248,4240.56110.7909  
squeezenet1.1SqueezeNet 1.11,235,4960.54960.7817  
vgg11VGG-11132,863,3360.66620.8734  
vgg13VGG-13133,047,8480.67740.8811  
vgg16VGG-16138,357,5440.73230.9132  
vgg19VGG-19143,667,2400.74110.9135  
vgg11_bnVGG-11 with batch normalization132,874,3440.68590.8872  
vgg13_bnVGG-13 with batch normalization133,059,6240.68840.8882  
vgg16_bnVGG-16 with batch normalization138,374,4400.7310.9176  
vgg19_bnVGG-19 with batch normalization143,689,2560.74330.9185  
  • No labels