Environment:
- CentOS 6
The following script deploys Bigtop Stack with Hadoop+Spark.
There's one thing you need to change manually:
- Replace bigtop1.apache.org to a hostname of your master node
The deployment script(Copy & Paste on every node in your cluster):
sudo rpm -ivh http://yum.puppetlabs.com/puppetlabs-release-el-6.noarch.rpm sudo yum -y install puppet sudo puppet module install puppetlabs-stdlib sudo git clone https://github.com/apache/bigtop.git /bigtop-home sudo cp -r /bigtop-home/bigtop-deploy/puppet/hieradata/ /etc/puppet/ sudo cp /bigtop-home/bigtop-deploy/puppet/hiera.yaml /etc/puppet/ # update configuration sudo su root -c "cat > /etc/puppet/hieradata/site.yaml << EOF --- ### This is very important to make sure this file doesn't contain any TAB ### symbols instead of spaces: the hiera YAML parser will choke to death with ### uncomprehensive error message and you'll waste a lot of time debugging it. bigtop::hadoop_head_node: "bigtop1.apache.org" hadoop::hadoop_storage_dirs: - /data/1 - /data/2 hadoop_cluster_node::cluster_components: - hadoop - spark bigtop::jdk_package_name: "java-1.8.0-openjdk-devel.x86_64" bigtop::bigtop_repo_uri: "http://bigtop-repos.s3.amazonaws.com/releases/1.2.0/centos/6/x86_64" EOF " sudo puppet apply --parser future --modulepath=/bigtop-home/bigtop-deploy/puppet/modules:/etc/puppet/modules /bigtop-home/bigtop-deploy/puppet/manifests
To customize your own Big Data Stack:
hadoop_cluster_node::cluster_components: - hadoop - spark - hbase ...