Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

...

2 rows affected (14.802 seconds)


0: jdbc:hive2://localhost:10000> select count(*) from default.emr_clone;

INFO  : Compiling command(queryId=ngangam_20240129182647_7544c9d1-c68b-4a34-b6b0-910945a1dba5): select count(*) from default.emr_clone

INFO  : Semantic Analysis Completed (retrial = false)

INFO  : Created Hive schema: Schema(fieldSchemas:[FieldSchema(name:_c0, type:bigint, comment:null)], properties:null)

INFO  : Completed compiling command(queryId=ngangam_20240129182647_7544c9d1-c68b-4a34-b6b0-910945a1dba5); Time taken: 6.282 seconds

INFO  : Concurrency mode is disabled, not creating a lock manager

INFO  : Executing command(queryId=ngangam_20240129182647_7544c9d1-c68b-4a34-b6b0-910945a1dba5): select count(*) from default.emr_clone

WARN  : Hive-on-MR is deprecated in Hive 2 and may not be available in the future versions. Consider using a different execution engine (i.e. tez, impala) or using Hive 1.X releases.

INFO  : Query ID = ngangam_20240129182647_7544c9d1-c68b-4a34-b6b0-910945a1dba5

INFO  : Total jobs = 1

INFO  : Launching Job 1 out of 1

INFO  : Starting task [Stage-1:MAPRED] in serial mode

INFO  : Number of reduce tasks determined at compile time: 1

INFO  : In order to change the average load for a reducer (in bytes):

INFO  :   set hive.exec.reducers.bytes.per.reducer=<number>

INFO  : In order to limit the maximum number of reducers:

INFO  :   set hive.exec.reducers.max=<number>

INFO  : In order to set a constant number of reducers:

INFO  :   set mapreduce.job.reduces=<number>

INFO  : number of splits:1

INFO  : Submitting tokens for job: job_local1913531390_0004

INFO  : Executing with tokens: []

INFO  : The url to track the job: http://localhost:8080/

INFO  : Job running in-process (local Hadoop)

INFO  : 2024-01-29 18:26:55,764 Stage-1 map = 100%,  reduce = 100%

INFO  : Ended Job = job_local1913531390_0004

INFO  : MapReduce Jobs Launched:

INFO  : Stage-Stage-1:  HDFS Read: 0 HDFS Write: 0 SUCCESS

INFO  : Total MapReduce CPU Time Spent: 0 msec

INFO  : Completed executing command(queryId=ngangam_20240129182647_7544c9d1-c68b-4a34-b6b0-910945a1dba5); Time taken: 2.344 seconds

INFO  : OK

+------+

| _c0  |

+------+

| 2    |

+------+

1 row selected (8.795 seconds)

0: jdbc:hive2://localhost:10000>


5. To fetch data from the remote tables, run SELECT queries using column spec and predicates as you would
normally with any SQL tables.

0: jdbc:hive2://localhost:10000> select * from test_emr_tbl where tblkey > 1;

INFO  : Compiling command(queryId=ngangam_20240129191217_79b9e874-197d-4c31-8164-1ec2397bbff7): select * from test_emr_tbl where tblkey > 1

INFO  : Completed compiling command(queryId=ngangam_20240129191217_79b9e874-197d-4c31-8164-1ec2397bbff7); Time taken: 6.219 seconds

INFO  : Compiling command(queryId=ngangam_20240129191217_79b9e874-197d-4c31-8164-1ec2397bbff7): select * from test_emr_tbl where tblkey > 1

INFO  : Semantic Analysis Completed (retrial = false)

INFO  : Created Hive schema: Schema(fieldSchemas:[FieldSchema(name:test_emr_tbl.tblkey, type:int, comment:null), FieldSchema(name:test_emr_tbl.descr, type:string, comment:null)], properties:null)

INFO  : Completed compiling command(queryId=ngangam_20240129191217_79b9e874-197d-4c31-8164-1ec2397bbff7); Time taken: 1.364 seconds

INFO  : Concurrency mode is disabled, not creating a lock manager

INFO  : Executing command(queryId=ngangam_20240129191217_79b9e874-197d-4c31-8164-1ec2397bbff7): select * from test_emr_tbl where tblkey > 1

INFO  : Completed executing command(queryId=ngangam_20240129191217_79b9e874-197d-4c31-8164-1ec2397bbff7); Time taken: 0.001 seconds

INFO  : OK

+----------------------+---------------------+

| test_emr_tbl.tblkey  | test_emr_tbl.descr  |

+----------------------+---------------------+

| 2                    | test 2              |

+----------------------+---------------------+

1 row selected (8.238 seconds)


6. Join with local hive tables, run SELECT queries joining multiple tables (local or remote) as you would
normally with any SQL tables.