THIS IS A TEST INSTANCE. ALL YOUR CHANGES WILL BE LOST!!!!
...
- Apache Mahout - Previously on Hadoop MapReduce, Mahout has switched to using Spark as the backend
- Apache MRQL - A query processing and optimization system for large-scale, distributed data analysis, built on top of Apache Hadoop, Hama, and Spark
- BlinkDB - a massively parallel, approximate query engine built on top of Shark and Spark
- Spindle - Spark/Parquet-based web analytics query engine
- Spark Spatial - Spatial joins and processing for Spark
- Thunderain - a framework for combining stream processing with historical data, think Lamba architecture
- DF from Ayasdi - a Pandas-like data frame implementation for Spark
- Oryx - Lambda architecture on Apache Spark, Apache Kafka for real-time large scale machine learning