THIS IS A TEST INSTANCE. ALL YOUR CHANGES WILL BE LOST!!!!
...
Configuring and Deploying Hadoop Services
...
Hadoop was configured and deployed in pseudo-distributed mode
hdfs-site.xml Configuration
Code Block |
---|
<configuration>
<property>
<name>dfs.replication</name>
<value>1</value>
</property>
</configuration> |
yarn-site.xml Configuration
Configuring and Deploying Tez
...
Running the Injector job on Tez
Run # | YARN Engine | # of URLs | Elapsed Time |
---|---|---|---|
1 | MapReduce | 11523 | 00:00:34 |
2 | MapReduce | 11523 | 00:00:32 |
3 | MapReduce | 11523 | 00:00:34 |
4 | Tez | 11523 | 00:00:42 |
5 | Tez | 11523 | 00:00:13 |
6 | Tez | 11523 | 00:00:14 |
7 | MapReduce | 15763469 | 00:03:21 |
8 | MapReduce | 15763469 | 00:03:13 |
9 | MapReduce | 15763469 | 00:02:38 |
10 | MapReduce | 15763469 | 00:02:37 |
11 | MapReduce | 15763469 | 00:02:48 |
12 | Tez | 15763469 | 00:02:14 |
13 | Tez | 15763469 | 00:02:10 |
14 | Tez | 15763469 | 00:02:13 |
From the above Tez clearly appears to offer significant runtime improvements over MapReduce. This is very promising however much more experimentation is required.
...