Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

...

We will create a test accelerator library that simply reuses the existing CPU and GPU operator implementations an run all existing unit tests. 

Implementation plan

  1. Implement a PR with basic symbolic flow: analyzeGraph, loadModel, infer
  2. Implement a followup PR with imperative accelerator flow (fcompute, storage, copy, etc)

Alternative Approaches

Currently, custom accelerators like TensorRT must be implemented by modifying the MXNet backend and learning how MXNet works at the lowest level. The team that implemented TensorRT support in MXNet ran through many hurdles and the learnings from that effort are being applied in this proposal. 

...