Page History

...

We will create a test accelerator library that simply reuses the existing CPU and GPU operator implementations an run all existing unit tests.

Implementation plan

Implement a PR with basic symbolic flow: analyzeGraph, loadModel, infer
Implement a followup PR with imperative accelerator flow (fcompute, storage, copy, etc)

Alternative Approaches

Currently, custom accelerators like TensorRT must be implemented by modifying the MXNet backend and learning how MXNet works at the lowest level. The team that implemented TensorRT support in MXNet ran through many hurdles and the learnings from that effort are being applied in this proposal.

...

Page tree

Versions Compared

Old Version 7

New Version 8

Key

Implementation plan

Alternative Approaches