Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

...

  • Sqoop 1 datatypes are different for Java, Hive and Avro and most of the time the connector does the datatype mapping.
  • For Sqoop 2 we would like to standardize on the datatype support so that all connectors can conform for consistency.
  • Some datatypes discussed today were:
    • String, char: UTF 8 encoded
    • Timestamp: Unix time representation to millisecond level
    • Binary: byte array
    • Numeric: byte, short, int, long, float, double, byte
    • Null type

2. Intermediate data format

...