Apache Kylin : Analytical Data Warehouse for Big Data
Welcome to Kylin Wiki.
Background
From Kylin 4.0.0, Kylin will provided two binary which verified on different Hadoop env. We choose some popular Hadoop distribution, such as Cloudera, HDP, AWS EMR.
Besides, we also include a custom Hadoop installation combination. For user who prefer a custom Hadoop combination, this may be helpful to you.
Kylin 4.0.0 Support Matrix
Kylin Binary | Hadoop Distribution | Spark | Hadoop | Hive | Cluster Manager | Distributed Storage | Verified ? | Comment |
---|---|---|---|---|---|---|---|---|
Kylin 4.0.0-spark2 | CDH 5.7 | 2.4.7 | 2.6.0-cdh5.7.6 | 1.1.0-cdh5.7.6 | YARN | HDFS |
| |
Kylin 4.0.0-spark2 | HDP 2.4 | 2.4.7 | 2.7.1.2.4.0.0-16 | 1.2.1000.2.4.0.0-16 | YARN | HDFS |
| |
Kylin 4.0.0-spark2 | AWS EMR 5.33.0 | 2.4.7 | 2.10.1-amzn-1 | Hive 2.3.7-amzn-4 | YARN | HDFS/S3 |
| |
Kylin 4.0.0-spark2 | CDH 6.2.0 | 2.4.7 | 3.0.0-cdh6.2.0 | 2.1.1-cdh6.2.0 | YARN | HDFS |
| |
Kylin 4.0.0-spark3 | AWS EMR 6.3.0 | 3.1.1 | 3.2.1-amzn-3 | 3.1.2-amzn-4 | YARN | HDFS/S3 |
| |
Kylin 4.0.0-spark3 | CDH 6.2.0 | 3.1.1 | 3.0.0-cdh6.2.0 | 2.1.1-cdh6.2.0 | YARN | HDFS |
| |
Kylin 4.0.0-spark3 | Apache | 3.1.1 | 3.2.0 | 2.3.9 | YARN, Standalone | S3 |
| http://kylin.apache.org/docs40/install/deploy_without_hadoop.html |
Note:
- Object storage such as S3 are not well tested, and is tagged as experimental feature, and performance is not good as HDFS. So it is not recommend in production env without a storage cache layer (such as Alluxio).
- When using Standalone as cluster manager, Kylin 4.0.0 only support client as deployMode .
Kylin 4.0.1 Support Matrix
Not released yet.
Overview
Content Tools
ThemeBuilder
Apps