Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

...

  • The TPC-DS generator is a private function, and needs to be imported to be used.
  • The TPC-DS generator is based on the dsdgen TPC-DS tool for generating the data.
  • The data is generated in parallel, utilizing all the available partitions on all the available nodes.
  • The generated data types are with accordance to the data types in the TPC-DS Schema. Each data type is converted to its respective proper type during the data generation (i.e integers, doubles, strings, ... etc)
  • DATE and TIME types are treated as String.
  • Two versions of the generator function exist:
    • Single parameter version: Takes a single parameter, namely,  scalefactor, and generates the data for all the tables for the specified scalefactor.
    • Two parameters versions: Takes two parameters, namely, tablename and scalefactor (in this order), and generates the data for the specified table tablename only, for the specified scalefactor.


Scale Factor to Data Size Ratio

...