Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

Map-Join Partition Pruning Optimization

Overview

In Hive Map-Join is a technique that materializes data for all tables involved in the join except for the largest table and then large table is streamed over the materialized data from small tables. Map-Join is often a good join approach for star-schema joins where the fact table will be streamed over materialized dimension tables.

...