Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.


Inference Performance

This group of the performance test is gathered on AWS EC2 instances in 1 socket.

  • Performance boost with Intel MKL-DNN backend in release 1.3

...

  • Performance gain from operator fusion by subgraph

Category

Model

Latency batchsize=1 (ms, small is better)Throughput batchsize=128 (fps, big is better)
R1.3 w/ MKL-DNNmaster w/ subgraphspeedupR1.3 w/ MKL-DNNmaster w/ subgraphspeedup

CNN/classification

ResNet-50 v1





ResNet-50 v2





Inception v3





Inception v4





DenseNet





MobileNet





VGG16





AlexNet





inception-resnet v2





CNN/object detection

Faster R-CNN





SSD-VGG16





SSD-MobileNet





RNN

GNMT





GANDCGAN




...