Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

Table of Contents

Usability

...

Improvements

  • Improved the usability of examples and tutorials
  • C++ Stack traces are now demangled in OSX and Linux (improved developer experience and bug reports).

Bug-fixes

  • Fix Fixed I/O multiprocessing for too many open file handles (#8904), race condition (#8995), deadlock (#9126).

  • Fix Fixed image IO integration with OpenCV 3.3 (#8757).

  • Fix Fixed Gluon block printing (#8956).

  • Fix Fixed float16 argmax when there is negative input. (#9149)

  • Fix Fixed random number generator to ensure sufficient randomness. (#9119, #9256, #9300)

  • Fix Fixed custom op multi-GPU scaling (#9283)

  • Fix Fixed gradient of gather_nd when duplicate entries exist in index. (#9200)

...

  • )

  • Fixed overriden contexts in Module `group2ctx` option when using multiple contexts (#8867)
  • Fixed `swap_axes` operator with "add_to" gradient req  (#9541)
  • Fixed SSD kernel launch resource limits on volta (#9480)


New Features

  • Added support for standard `SGD` & `Adam` optimizer with `row_sparse` gradient.experimental API in `contrib.text` for building vocabulary, and loading pre-trained word embeddings, with built-in support for 307 GloVe and FastText pre-trained embeddings. (#8763)
  • Added experimental structural blocks in `gluon.contrib`: `Concurrent`, `HybridConcurrent`, `Identity`. (#9427) 
  • Added `sparse.dot(dense, csr)` operator 

API Changes

  •  

...

  • operator (#8938)
  • Added `Khatri-Rao` operator (#7781)
  • Added `FTML` and `Signum` optimizer (#9220, #9262)
  • Added `ENABLE_CUDA_RTC` build option (#9428)

API Changes

  • Added zero gradients to rounding operators including `rint`, `ceil`, `floor`, `trunc`, and `fix` (#9040)

  • Added `use_global_stats` in `nn.BatchNorm` (#9420)
  • Added `axis` argument to `SequenceLast`, `SequenceMask` and `SequenceReverse` operators (#9306)
  • Added `lazy_update` option for standard `SGD` & `Adam` optimizer with `row_sparse` gradients (#9468, #9189)
  • Added `select` option in `Block.collect_params` to support regex (#9348)
  • Added support for (one-to-one and sequence-to-one) inference on explicit unrolled RNN models in R (#9022) 

Deprecations

Performance

 

  • (#9579, #9324)

Performance Improvements

  • Improved GPU inference speed by 20% when batch _ size =is 1 (#9055)
  • Improved `SequenceLast` operator speed (#9306)
  • Added multithreading for the class of broadcast_reduce operators on CPU (#9444)
  • Improved batching for GEMM/TRSM operators with large matrices on GPU (#8846)

Known Issues

  • "Predict with pre-trained models" tutorial is broken
  • "example/numpy-ops/ndarray_softmax.py" is broken


How to build MXNet

Please follow the instructions at https://mxnet.incubator.apache.org/install/index.html

List of submodules used by Apache MXNet (Incubating) and when they were updated last

Submodule:: Last updated by MXNet:: Last update in submodule

  1. cub@:: Jul 31, 2017 :: Jul 31, 2017
  2. dlpack@: May 20, 2017 :: Oct 30, 2017
  3. dmlc-core@: Jan 16, 2017 2018 :: Jan 17, 20172018
  4. mshadow@:  December 19Jan 11, 2017 :: Jan 1015, 20172018
  5. nnvm@: Dec 9, 2017 :: Jan 10, 2017
  6. ps-lite@: Nov 21, 2017 :: Jan 2, 2017