...
No. | Item | Description | Link | Open |
---|---|---|---|---|
1 | Improved error message for Elastic Net predict() | When we pass the selected coefficients to elastic net's "predict()" function, it throws as ugly error message which is not indicative of the real error. | https://issues.apache.org/jira/browse/MADLIB-835 | Open |
2 | Confusing Error Messages while running elastic net prediction function | Fix confusing error message | https://issues.apache.org/jira/browse/MADLIB-787 | Open |
3 | LDA (parsed) model table and output table disagree | Investigate and determine if this is an issue. If it is, repair it. | https://issues.apache.org/jira/browse/MADLIB-899 | Open |
4 | PivotalR test failures indicate potential bugs in MADlib GLM | These problems may be just numerical issues with too large the condition numbers or too small of a training set. To be investigated. | https://issues.apache.org/jira/browse/MADLIB-896 | Open |
5 | Implement skipping of arrays-with-NULL for elastic net predict | Better NULL handling for elastic net predict. | https://issues.apache.org/jira/browse/MADLIB-919 | Open |
6 | Improve RF output format for variable importance | Easier way of accessing the variable importance output from random forest so that I can understand which are the most important variables. | https://issues.apache.org/jira/browse/MADLIB-925 | Open |
7 | Covariance matrix | Add parameter to output covariance matrix to Pierson's correlation function. | https://issues.apache.org/jira/browse/MADLIB-924 | Open |
8 |
...
New Features
No. | Item | Description | Link | Open | ||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|
1 | Add PMML export modules* | Support additional MADlib modules for PMML export |
| Open | ||||||||
2 |
*Some notes on PMML below...
...
JPMML is an open source PMML evaluator available under GPL license.
For more information, please see https://github.com/jpmml and https://github.com/jpmml/openscoring
You can only export from MADlib into PMML (no import currently)
New Non-Iterative Modules
New Iterative Modules
PivotalR
PivotalR is a package that enables users of R, the most popular open source statistical programming language and environment, to interact with the Greenplum database, HAWQ and PostgreSQL on large data sets. It does so by providing an interface to the operations on tables/views in the database.
...