Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

...

In this method, rows are hashed into nb number of buckets such that the total rows within a bucket is num of buckets are bounded. A spark DAG is used to partition rows and to sort them within each partition. Actual read and write operations happen within the executors.

...