THIS IS A TEST INSTANCE. ALL YOUR CHANGES WILL BE LOST!!!!
...
- Tez apache page: http://incubator.apache.org/projects/tez.html
- Tez design doc: https://issues.apache.org/jira/browse/TEZ-65
- Tez presentation: http://www.youtube.com/watch?v=9ZLLzlsz7h8
- Tez slides: http://www.slideshare.net/Hadoop_Summit/murhty-saha-june26255pmroom212
...
The query (rewritten for hive):
Code Block | ||||
---|---|---|---|---|
| ||||
select i_item_desc |
...
,i_category |
...
,i_class |
...
,i_current_price |
...
,i_item_id |
...
,itemrevenue |
...
,itemrevenue*100/sum(itemrevenue) |
...
over (partition by i_class) as revenueratio |
...
from
(select
...
from (select i_item_desc |
...
,i_category |
...
,i_class |
...
,i_current_price |
...
,i_item_id |
...
,sum(ws_ext_sales_price) as |
...
from
web_sales
...
itemrevenue from web_sales join item on (web_sales.ws_item_sk = item.i_item_sk) |
...
join date_dim on (web_sales.ws_sold_date_sk = date_dim.d_date_sk) |
...
where
...
where i_category in ('1', '2', '3') |
...
and year(d_date) = 2001 and month(d_date) = |
...
group by
...
10 group by i_item_id |
...
,i_item_desc |
...
,i_category |
...
,i_class |
...
,i_current_price) tmp |
...
order |
...
i_category
...
by i_category ,i_class |
...
,i_item_id |
...
,i_item_desc |
...
,revenueratio; |
Plan with TEZ
Stage 0:
Local Work: Generate hash table for date dim
...