1. Build HAWQ/PXF Ranger RPM package
- Step 1: Prepare a HAWQ source code tarball for rpm building.
Make sure the source code tarball name format is: apache-hawq-src-%{hawq_version}-incubating.tar.gz
wget -O https://dist.apache.org/repos/dist/release/incubator/hawq/%{hawq_version}-incubating/apache-hawq-src-%{hawq_version}-incubating.tar.gz
- b) Make sure the extracted directory name format is: apache-hawq-src-%{hawq_version}-incubating
- c) Put the HAWQ source tarball to %{top_dir}/contrib/hawq-package folder. Or use "HAWQ_SOURCE_TARBALL_PATH" to specify the find path of HAWQ source tarball.
- Step 2: Configuration operating system and install dependency libraries.
- Step 3: Run 'make distclean' and './configure'.
- Step 4: Run 'make rpm' in the top directory of source code
- You can set environment variable "HAWQ_RELEASE_VERSION" to specify HAWQ version
- Step 5: You can find HAWQ/PFX/Ranger plugin rpms tarball at:
- %{top_dir}/contrib/hawq-package/apache-hawq-rpm-%{hawq_version}-incubating.tar.gz
- Please refer to: https://github.com/apache/incubator-hawq/blob/master/contrib/hawq-package/README for details.
2. Install HAWQ/PXF/Ranger with RPM
Step 1: Configure operating system following instruction in Appendix 1.
- Step 2: Install runtime dependency libraries following instruction in Appendix 2.
Step 3: Install Apache Hadoop rpm packages using bigtop.
Download bigtop repo as /etc/yum.repos.d/bigtop.repo from https://www-us.apache.org/dist/bigtop/bigtop-1.2.0/repos/centos7/
sudo wget -O /etc/yum.repos.d/bigtop.repo https://www-us.apache.org/dist/bigtop/bigtop-1.2.0/repos/centos7/bigtop.repo
Install Hadoop component including Hadoop, Hive, Hbase and Tomcat. Note that bigtop-tomcat version need to be >= 6.0.
sudo yum install bigtop-tomcat hadoop\* hive\* hbase\*
Install Ranger.
sudo yum install ranger
Step 4: Get HAWQ/PXF/Ranger RPM package
Download rpm binary release tarball from Apache HAWQ binary repository
wget -O apache-hawq-src-%{hawq_version}-incubating.tar.gz https://dist.apache.org/repos/dist/release/incubator/hawq/%{hawq_version}-incubating/apache-hawq-src-%{hawq_version}-incubating.tar.gz
Or copy it if you compile it by yourself
cp $HAWQ_SOURCE_DIR/contrib/hawq-package/apache-hawq-rpm-%{hawq_version}-incubating.tar.gz .
Step 5: Install HAWQ/PXF Ranger with RPM package
Extract apache-hawq-rpm-%{hawq_version}-incubating.tar.gz and go to the directory ‘hawq_rpm_packages’
tar -xzvf apache-hawq-rpm-%{hawq_version}-incubating.tar.gz; cd hawq_rpm_packages
Install the rpm packages with below sequence
rpm -ivh apache-hawq-2.2.0.0-el7.x86_64.rpm rpm -ivh hawq-ranger-plugin-2.2.0.0-1.el7.centos.noarch.rpm rpm -ivh apache-tomcat-7.0.62-el6.noarch.rpm rpm -ivh pxf-service-3.2.1.0-1.el6.noarch.rpm rpm -ivh pxf-hdfs-3.2.1.0-1.el6.noarch.rpm rpm -ivh pxf-hive-3.2.1.0-1.el6.noarch.rpm rpm -ivh pxf-hbase-3.2.1.0-1.el6.noarch.rpm rpm -ivh pxf-jdbc-3.2.1.0-1.el6.noarch.rpm rpm -ivh pxf-json-3.2.1.0-1.el6.noarch.rpm rpm -ivh pxf-3.2.1.0-1.el6.noarch.rpm
Appendix 1: Configure operating system
Configure system control parameters: edit /etc/sysctl.conf file to set parameters with values; then execute "sudo sysctl -p" to apply the changes.
kernel.shmmax = 1000000000 kernel.shmmni = 4096 kernel.shmall = 4000000000 kernel.sem = 250 512000 100 2048 kernel.sysrq = 1 kernel.core_uses_pid = 1 kernel.msgmnb = 65536 kernel.msgmax = 65536 kernel.msgmni = 2048 net.ipv4.tcp_syncookies = 0 net.ipv4.conf.default.accept_source_route = 0 net.ipv4.tcp_tw_recycle = 1 net.ipv4.tcp_max_syn_backlog = 200000 net.ipv4.conf.all.arp_filter = 1 net.ipv4.ip_local_port_range = 1281 65535 net.core.netdev_max_backlog = 200000 vm.overcommit_memory = 2 fs.nr_open = 3000000 kernel.threads-max = 798720 kernel.pid_max = 798720 # increase network net.core.rmem_max=2097152 net.core.wmem_max=2097152 fs.file-max=3000000
Configure system limits: edit /etc/security/limits.conf file to parameters in extract the same order; then relogin to make them take effect.
* soft nofile 2900000 * hard nofile 2900000 * soft nproc 131072 * hard nproc 131072
Appendix 2: Install runtime dependency libraries
# Install epel For CentOS 7: rpm -ivh https://dl.fedoraproject.org/pub/epel/7/x86_64/e/epel-release-7-8.noarch.rpm For RHEL 7: rpm -ivh https://dl.fedoraproject.org/pub/epel/epel-release-latest-7.noarch.rpm # For RHEL 7, enable rhel-7-server-extras-rpms and rhel-7-server-optional-rpms channel in /etc/yum.repos.d/redhat.repo yum install -y libgsasl thrift boost protobuf java-1.7.0-openjdk-devel \ python-pip krb5-server krb5-libs json-c net-snmp-libs # Install tomcat6 if ranger plugin service is enabled # yum install bigtop-tomcat # Install pycrypto pip --retries=50 --timeout=300 install pycrypto