THIS IS A TEST INSTANCE. ALL YOUR CHANGES WILL BE LOST!!!!

Apache Kylin : Analytical Data Warehouse for Big Data

Page tree

Welcome to Kylin Wiki.



 Spark Job Option


PropertyRequiredPriorityDatatypeDefaultDescriptionVersionReference
kylin.engine.spark.build-class-name
nolowString
org.apache.kylin.engine.spark.job.CubeBuildJob
For developer only. The className use in spark-submit.4.0+
kylin.engine.spark.cluster-info-fetcher-class-name















kylin.storage.provider
no
String
不同的云厂商返回的 ContentSummary 对象不尽相同, 需要针对性地提供实现
请参考 org.apache.kylin.common.storage.IStorageProvider


kylin.engine.spark.merge-class-name
no
String
org.apache.kylin.engine.spark.job.CubeMergeJob



kylin.engine.spark.task-impact-instance-enabled
no
Boolean
Check kylin.engine.spark.task-core-factorAffect spark.executor.instances for Build Engine.

kylin.engine.spark.task-core-factor
no
Integer


kylin.engine.driver-memory-base
no
Integer
Affect spark.driver.memory for Build Engine.



kylin.engine.driver-memory-strategy
no




kylin.engine.driver-memory-maximum
no
Integer


kylin.engine.persist-flattable-threshold
no




kylin.snapshot.parallel-build-timeout-seconds
no



如果希望提升快照的构建速度的话, 可以设置这个


kylin.snapshot.parallel-build-enabled
no
Boolean










kylin.spark-conf.auto.prior
no
Boolean
是否需要自动设置一些 SparkConf

kylin.engine.submit-hadoop-conf-dir




Set HADOOP_CONF_DIR for spark-submit.


kylin.storage.columnar.shard-size-mb






和 Shard 相关的一系列配置, 我暂时还不懂


ylin.storage.columnar.shard-rowcount






kylin.storage.columnar.shard-countdistinct-rowcount






kylin.query.spark-engine.join-memory-fraction




限制 广播Join使用的内存, 这个名字是不是有问题, 为啥是 query 开头

  • No labels