Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

...

Note: It may be confusing as to the difference between SORT BY alone of a single column and CLUSTER BY. The difference is that CLUSTER BY partitions by the field and SORT BY if there are multiple reducers SORT BY partitions randomly in order to distribute data (and load) uniformly across the reducers.

...