Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

Candidate Some potential roadmap for remainder of 2016:ideas for 2018+ below.  For more details please see the JIRAs

Predictive models

...

  • Multi-class SVM 
    Jira
    serverASF JIRA
    serverId5aa69414-a9e9-3523-82ec-879b028fb15b
    keyMADLIB-1037
  • Mixed effects modeling 
    Jira
    serverASF JIRA
    serverId5aa69414-a9e9-3523-82ec-879b028fb15b
    keyMADLIB-987
  • Gradient boosted machines
  • Gaussian Mixture Model using Expectation Maximization (EM)

...

  • algorithm 
    Jira
    serverASF JIRA
    serverId5aa69414-a9e9-3523-82ec-879b028fb15b
    keyMADLIB-410
  • Hierarchical clustering
  • k-NN improvements 
    Jira
    serverASF JIRA
    serverId5aa69414-a9e9-3523-82ec-879b028fb15b
    keyMADLIB-1061
    Jira
    serverASF JIRA
    serverId5aa69414-a9e9-3523-82ec-879b028fb15b
    keyMADLIB-1181
  • Deep learning

Graph

  • Personalized PageRank 
    Jira
    serverASF JIRA
    serverId5aa69414-a9e9-3523-82ec-879b028fb15b
    keyMADLIB-1084
  • Betweenness centrality 
    Jira
    serverASF JIRA
    serverId5aa69414-a9e9-3523-82ec-879b028fb15b
    keyMADLIB-1121
  • Graph cut 
    Jira
    serverASF JIRA
    serverId5aa69414-a9e9-3523-82ec-879b028fb15b
    keyMADLIB-1074
  • Triangle counting 
    Jira
    serverASF JIRA
    serverId5aa69414-a9e9-3523-82ec-879b028fb15b
    keyMADLIB-1125
  • Minimum spanning tree
  • Eigenvector centrality 
    Jira
    serverASF JIRA
    serverId5aa69414-a9e9-3523-82ec-879b028fb15b
    keyMADLIB-1123
  • APSP performance improvements 
    Jira
    serverASF JIRA
    serverId5aa69414-a9e9-3523-82ec-879b028fb15b
    keyMADLIB-1155

Utilities

  • Balanced datasets 
    Jira
    serverASF JIRA
    serverId5aa69414-a9e9-3523-82ec-879b028fb15b
    keyMADLIB-1168

    Summary - add more statistics 
    Jira
    serverASF JIRA
    serverId5aa69414-a9e9-3523-82ec-879b028fb15b
    keyMADLIB-1167
  • Anonymization  
    Jira
    serverASF JIRA
    serverId5aa69414-a9e9-3523-82ec-879b028fb15b
    keyMADLIB-911
  • URI tools  
    Jira
    serverASF JIRA
    serverId5aa69414-a9e9-3523-82ec-879b028fb15b
    keyMADLIB-910

Graph

Shortest path  MADLIB-992
Standard traversal
  depth first search
  breadth first search
  topological sort
One mode projection (converting a bi-partitite graph of user-item graph to user-user or item-item graph)
Connected components
Page rank
Hierarchical graph cut
Between-ness centrality
Minimum spanning tree

Utilities

...

Usability

...

  • Interface improvement and consistency

...

  • for 2.0 release (does not need to be backward compatible) 
    Jira
    serverASF JIRA
    serverId5aa69414-a9e9-3523-82ec-879b028fb15b
    keyMADLIB-1157
  • Python API

Performance and scalability

Work around PostgreSQL 1 GB field size limit  MADLIB-991

Platform

  • Mini-batching 
    Jira
    serverASF JIRA
    serverId5aa69414-a9e9-3523-82ec-879b028fb15b
    keyMADLIB-1048
    Jira
    serverASF JIRA
    serverId5aa69414-a9e9-3523-82ec-879b028fb15b
    keyMADLIB-1037
    Jira
    serverASF JIRA
    serverId5aa69414-a9e9-3523-82ec-879b028fb15b
    keyMADLIB-1200
  • Improve decision tree and random forest performance for run-time and memory usage 
    Jira
    serverASF JIRA
    serverId5aa69414-a9e9-3523-82ec-879b028fb15b
    keyMADLIB-1057
    Jira
    serverASF JIRA
    serverId5aa69414-a9e9-3523-82ec-879b028fb15b
    keyMADLIB-976

Platforms and Frameworks

  • PostgreSQL 10 support 
    Jira
    serverASF JIRA
    serverId5aa69414-a9e9-3523-82ec-879b028fb15b
    keyMADLIB-1185
  • Support modern versions of gcc 
    Jira
    serverASF JIRA
    serverId5aa69414-a9e9-3523-82ec-879b028fb15b
    keyMADLIB-1025
  • Tensorflow support, or another deep learning framework
  • GPU support

...