Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.
Comment: Added part about automated testing

...

  • Be widely known
  • Be used and accepted (academic citations and concrete use cases can help justify this)
  • Be highly scalable
  • Be well documented
  • Have APIs consistent with other algorithms in MLLib that accomplish the same thing
  • Come with a reasonable expectation of developer support.

Automated Testing

Spark comes with a fairly comprehensive suite for unit tests, functional tests and integration tests. All pull requests are automatically tested on Jenkins, currently hosted by the Berkeley AMPLab. In order to run the same set of tests Jenkins run locally, you can use the scripts in /dev/run-tests.

Starter Tasks

If you are new to Spark and want to contribute, you can browse through the list of starter tasks on our JIRA. These tasks are typically small and simple, and are excellent problems to get you ramped up.

...