Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

        Image Removed

 

  Image Added


MADlib graduated to an Apache Top Level Project on 7/19/17.  Read the press release. 

Apache MADlib® is an open-source library for scalable in-database analytics.

It provides data-parallel implementations of mathematical, statistical,

graph and machine learning methods for structured and unstructured data.

Quick Start Guides

...

General Information

Learn about MADlib.

General Information

Developer Documentation

master build statusImage Removed

Contribute to the project.Image Added

Architecture

See how the pieces fit together. 

Release Notes

See what has been released.

Third Party Components

MADlib incorporates material from the following third-party components:

  1. argparse 1.2.1 provides an easy, declarative interface for creating command line tools
  2. Boost 1.47.0 (or newer) provides peer-reviewed portable C++ source libraries
  3. Eigen 3.2.2 is a C++ template library for linear algebra
  4. PyYAML 3.10 is a YAML parser and emitter for Python
  5. PyXB 1.2.4 is a Python library for XML Schema Bindings
  6. Porter2 stemmer reduces workds to common roots for comparison and operating on.
  7. UseLATEX.cmake contains CMAKE commands to use the LaTeX compiler

Licensing

License information regarding MADlib and included third-party libraries can be found inside the license directory.  ASF licensing guidance for MADlib pertaining to its pre-Apache history as an open source project with BSD licensing is described here.

Papers

Related Software

  • PivotalR - lets the user run the functions of the open-source big-data machine learning package MADlib directly from R.

  • PyMADlib  - a nascent Python wrapper for MADlib, which brings you the power and flexibility of python with the number crunching power of MADlib.

 


 

 

 

 

 

 










 Image Added