Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.
Comment: debug link to Bucketed Sorted Tables in DDL doc

This is a brief example on creating and populating bucketed tables. (For another example, see
Bucketed Sorted Tables – Hive: no spaces
Bucketed Sorted Tables – Hive: with spaces
Bucketed Sorted Tables – no Hive:, with spaces
Bucketed Sorted Tables.) – no Hive:, no spaces

Bucketed tables are fantastic in that they allow much more efficient sampling than do non-bucketed tables, and they may later allow for time saving operations such as mapside joins. However, the bucketing specified at table creation is not enforced when the table is written to, and so it is possible for the table's metadata to advertise properties which are not upheld by the table's actual layout. This should obviously be avoided. Here's how to do it right.

...