install Hadoop

Installing Java

Hadoop runs on both Unix and Windows operating systems, and requires Java to be
installed. For a production installation, you should select a combination of operating
system, Java, and Hadoop that has been certified by the vendor of the Hadoop distribution
you are using. There is also a page on the Hadoop wiki that lists combinations
that community members have run with success.

Creating Linux User Accounts

It’s good practice to create dedicated Unix user accounts to separate the Hadoop processes
from each other, and from other services running on the same machine. The
HDFS, MapReduce, and YARN services are usually run as separate users, named hdfs,
mapred, and yarn, respectively. They all belong to the same hadoop group

Installing Hadoop
Download Hadoop from the Apache Hadoop releases page, and unpack the contents of
the distribution in a sensible location, such as /usr/local (/opt is another standard choice;
note that Hadoop should not be installed in a user’s home directory, as that may be an
NFS-mounted directory):
% cd /usr/local
% sudo tar xzf hadoop-x.y.z.tar.gz
You also need to change the owner of the Hadoop files to be the hadoop user and group:
% sudo chown -R hadoop:hadoop hadoop-x.y.z
It’s convenient to put the Hadoop binaries on the shell path too:
% export HADOOP_HOME=/usr/local/hadoop-x.y.z
% export PATH=$PATH:$HADOOP_HOME/bin:$HADOOP_HOME/sbin

Configuring SSH

ssh-keygen -t rsa -f ~/.ssh/id_rsa

% cat ~/.ssh/id_rsa.pub >> ~/.ssh/authorized_keys

Test that you can SSH from the master to a worker machine by making sure ssh-agent
is running,3 and then run ssh-add to store your passphrase. You should be able to SSH
to a worker without entering the passphrase again.

Formatting the HDFS Filesystem

% hdfs namenode -format

Starting and Stopping the Daemons

% start-dfs.sh

% start-yarn.sh

% mr-jobhistory-daemon.sh start historyserver

hdfs getconf -namenodes

Creating User Directories
Once you have a Hadoop cluster up and running, you need to give users access to it.
This involves creating a home directory for each user and setting ownership permissions
on it:
% hadoop fs -mkdir /user/username
% hadoop fs -chown username:username /user/username
This is a good time to set space limits on the directory. The following sets a 1 TB limit
on the given user directory:
% hdfs dfsadmin -setSpaceQuota 1t /user/username

Hadoop Configuration

时间： 2024-08-21 02:21:31

install Hadoop

install Hadoop的相关文章

install hadoop on xubuntu

mac osx 系统 brew install hadoop 安装指南

Hadoop学习日志- install hadoop

How to install Hadoop 2.7.3 cluster on CentOS 7.3

Ambari Install Hadoop ecosystem for 9 steps

Install Hadoop 2.5 on CentOS 6.5

Install hadoop with Cloudera Manager 5 on CentOS 6.5

Install hadoop with Cloudera Manager 5.2 on CentOS 6.5

install hadoop 2.6.0 on ubuntu 15.04