Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

...

Decreasing this value will reduce the time it takes for compaction to be started for a table or partition that requires compaction.  However, checking if compaction is needed requires several calls to the NameNode for each table or partition that has had a transaction done on it since the last major compaction.  So decreasing this value will increase the load on the NameNode.

hive.compactor.cleaner.run.interval
  • Default Value: 5000
  • Added In: Hive 0.14 with HIVE-8258

Time in milliseconds between runs of the cleaner thread.  Increasing this value will lengthen the time it takes to clean up old, no longer used versions of data and lower the load on the metastore server.  Decreasing this value will shorten the time it takes to clean up old, not longer used version of the data and increase the load on the metastore server.

hive.compactor.delta.num.threshold
  • Default Value: 10
  • Added In: Hive 0.13.0 with HIVE-5843

...