Page History

Versions Compared

Key

This line was added.
This line was removed.
Formatting was changed.

Table of Contents

Basic configuration

Property	Default	Desc	Since
kylin.snapshot.parallel-build-enabled
kylin.snapshot.parallel-build-timeout-seconds
kylin.snapshot.shard-size-mb
kylin.storage.columnar.shard-size-mb
kylin.storage.columnar.shard-rowcount
kylin.storage.columnar.shard-countdistinct-rowcount
kylin.storage.columnar.repartition-threshold-size-mb
kylin.engine.submit-hadoop-conf-dir

Advanced configuration

Property	Default	Since
kylin.engine.spark.cache-parent-dataset-storage-level	NONE	4.0.0
kylin.engine.spark.cache-parent-dataset-count	1	4.0.0
kylin.engine.build-base-cuboid-enabled	true	4.0.0

Spark resources automatic adjustment strategy

Property	Default	Desc	Since
kylin.spark-conf.auto.prior
kylin.engine.driver-memory-base
kylin.engine.driver-memory-maximum
kylin.engine.driver-memory-strategy

Global dictionary

Data shew

Property

Default

Description

Version

kylin.engine.spark.build-class-name

org.apache.kylin.engine.spark.job.CubeBuildJob

For developer only. The className use in spark-submit

Status

subtle	true
colour	Blue
title	4.0.0-alpha

kylin.engine.spark.cluster-info-fetcher-class-name

org.apache.kylin.cluster.YarnInfoFetcher

For developer only. Fetch yarn information of spark job

Status

subtle	true
colour	Blue
title	4.0.0-alpha

kylin.engine.spark-conf.XXX

Before Kylin submit a cubing job, some major property(cores and memory) will be automatically adjusted adaptively. (if kylin.spark-conf.auto.prior was set to true).
After auto adjust, spark conf will be overwrite by this property. If you want to set spark.driver.extraJavaOptions=-Dhdp.version=current, you can add follow line in kylin.properties:

kylin.engine.spark-conf.spark.driver.extraJavaOptions=-Dhdp.version=current

Status

subtle	true
colour	Blue
title	4.0.0-alpha

kylin.storage.provider

org.apache.kylin.common.storage.DefaultStorageProvider

The content summary objects returned by different cloud vendors are not the same, so need to provide targeted implementation.

You can refer to this to learn more : org.apache.kylin.common.storage.IStorageProvider

Status

subtle	true
colour	Blue
title	4.0.0-alpha

kylin.engine.spark.merge-class-name

org.apache.kylin.engine.spark.job.CubeMergeJob

For developer only. The className use in spark-submit

Status

subtle	true
colour	Blue
title	4.0.0-alpha

kylin.engine.spark.task-impact-instance-enabled

true

Status


subtle	true
colour	Yellow
title	Updating

Status

subtle	true
colour	Blue
title	4.0.0-alpha

kylin.engine.spark.task-core-factor

3

Status


subtle	true
colour	Yellow
title	Updating

Status

subtle	true
colour	Blue
title	4.0.0-alpha

kylin.engine.driver-memory-base

1024

Auto adujst spark.driver.memory for Build Engine if kylin.engine.spark-conf.spark.driver.memory is not set.

Status

subtle	true
colour	Blue
title	4.0.0-alpha

kylin.engine.driver-memory-strategy

{"2", "20", "100"}

 Status
subtle true
colour Yellow
title Updating

Status

subtle	true
colour	Blue
title	4.0.0-alpha

kylin.engine.driver-memory-maximum

4096

Status


subtle	true
colour	Yellow
title	Updating

Status

subtle	true
colour	Blue
title	4.0.0-alpha

kylin.engine.persist-flattable-threshold

1

If the number of cuboids which will be build from flat table is bigger than this threshold, the flat table will be persisted into $HDFS_WORKING_DIR/job_tmp/flat_table for saving more memory.

Status

subtle	true
colour	Blue
title	4.0.0-alpha

kylin.snapshot.parallel-build-timeout-seconds

3600

To improve the speed of snapshot build.

Status

subtle	true
colour	Blue
title	4.0.0-alpha

kylin.snapshot.parallel-build-enabled

true

Status


subtle	true
colour	Yellow
title	Updating

kylin.spark-conf.auto.prior

true

Enable adjust spark parameters adaptively.

Status

subtle	true
colour	Blue
title	4.0.0-alpha

kylin.engine.submit-hadoop-conf-dir

/etc/hadoop/conf

Set HADOOP_CONF_DIR for spark-submit.

Status

subtle	true
colour	Blue
title	4.0.0-alpha

kylin.storage.columnar.shard-size-mb

128

The max size of pre-calcualted cuboid parquet file.

Status

subtle	true
colour	Blue
title	4.0.0-alpha

kylin.storage.columnar.shard-rowcount

2500000

The max rows of pre-calcualted cuboid parquet file.

Status

subtle	true
colour	Blue
title	4.0.0-alpha

kylin.storage.columnar.shard-countdistinct-rowcount

1000000

The max rows of pre-calcualted cuboid parquet file when cuboid has bitmap measure. (When cuboid has BItmap, it is large.)

Status

subtle	true
colour	Blue
title	4.0.0-alpha

kylin.query.spark-engine.join-memory-fraction

0.3

Limit memory used by broadcast join of Sparder. (Broadcast join cause unstable.)

Status

subtle	true
colour	Blue
title	4.0.0-alpha

...

Space shortcuts

Page tree

Versions Compared

Old Version 34

New Version 35

Key

Basic configuration

Advanced configuration

Spark resources automatic adjustment strategy

Global dictionary

Data shew