Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.
Comment: change hive.optimize.tez to hive.execution.engine (HIVE-6103)

Table of Contents
inline
inline

Overview

Tez is a new application framework built on Hadoop Yarn that can execute complex directed acyclic graphs of general data processing tasks. In many ways it can be thought of as a more flexible and powerful successor of the map-reduce framework.

...

One new configuration variable will be introduced:

  • hive.optimize.tez 
    hive.execution.engine (changed in HIVE-6103)
    • True 
      tez: Submit native TEZ dags, optimized for MRR/MPJ
    • False 
      mr (default): Submit single map, single reduce plans

...

The query (rewritten for hive):

Code Block
sql
sql

select
  i_item_desc
  ,i_category
  ,i_class
  ,i_current_price
  ,i_item_id
  ,itemrevenue
  ,itemrevenue*100/sum(itemrevenue) over
    (partition by i_class) as revenueratio
from
  (select
     i_item_desc
     ,i_category
     ,i_class
     ,i_current_price
     ,i_item_id
     ,sum(ws_ext_sales_price) as itemrevenue
   from
     web_sales
     join item on (web_sales.ws_item_sk = item.i_item_sk)
     join date_dim on (web_sales.ws_sold_date_sk = date_dim.d_date_sk)
   where
     i_category in ('1', '2', '3')
     and year(d_date) = 2001 and month(d_date) = 10
   group by
     i_item_id
     ,i_item_desc
     ,i_category
     ,i_class
     ,i_current_price) tmp
order by
  i_category
  ,i_class
  ,i_item_id
  ,i_item_desc
  ,revenueratio;

...