Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

This guide explains all of the elements needed to successfully develop and plug in a new MADlib® module.

  1. Prerequisites
  2. Docker Image
  3. Adding a New Module
  4. Adding an Iterative UDF

...

MADlib source code is organized such that the core logic of a machine learning or statistical module is located in a common location, and the database-port specific code is located in a ports folder.  Since all currently supported databases are based on Postgres, the postgres port contains all the port-specific files, with greenplum and hawq inheriting from it.  Before proceeding with this guide, it is recommended that you familiarize yourself with the MADlib module anatomy.

Anchor

...

Dock

...

Dock
Docker Image

We provide a Docker image with necessary dependencies required to compile and test MADlib on PostgreSQL 9.6.  You can view the dependency docker file at  ./tool/docker/base/Dockerfile_postgres_9_6. The image is hosted on docker hub at madlib/postgres_9.6:latestLater we will provide a similar Docker image for Greenplum Database.

...