Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

...

  • Efficient predicate pushdown. Can store all columns metadata together. Only query desired columns efficiently
  • Parallel processing of splits natively supported
  • Can provide UDF support if needed.
    • This may be useful for geo queries commonly used
    • Example: Table has latitude/longitude columns in table. But we can query data in hexagon/quad-tree efficiently using data skipping index.  using ranges for latitude and longitude may not be as effective.
  • Better storage compression
  • We can try different layouts by sorting data on different parameters (partition/fileId/columnBeingIndexed etc)

...