Install Sqoop2
About this task
Sqoop2 includes two packages:
- The client package,
mapr-sqoop2-client
(install on each node that will act as a client) - The server package,
mapr-sqoop2-server
(install on at least one node in the cluster)
The Sqoop2 server also acts as a MapReduce client, so each node where you install the
mapr-sqoop2-server
package must also have Hadoop installed and
configured. If you have a lot of client users, you can install multiple servers on multiple
nodes, so as not to overload a single server. WARNING: Sqoop2 packages cannot
be installed on the same nodes as Sqoop1 packages. However, you can use both versions in
the same Hadoop cluster by installing Sqoop1 and Sqoop2 on different nodes.
Execute the following commands as root
or using sudo:
Procedure
-
Update the list of available packages:
- Ubuntu
-
apt-get update
- RedHat and CentOS
-
yum clean all
-
On each Sqoop2 server node, install
mapr-sqoop2-server
:- On Ubuntu
-
apt-get install mapr-sqoop2-server
- On RedHat and CentOS
-
yum install mapr-sqoop2-server
-
On each Sqoop2 client node, install
mapr-sqoop2-client
:- On Ubuntu
-
apt-get install mapr-sqoop2-client
- On RedHat and CentOS
-
yum install mapr-sqoop2-client
-
If you installed Sqoop2 1.99.3 (version prior to the 1507 release) and the cluster
runs in YARN mode (MRv2), update the path to the Hadoop configuration directory in the
sqoop.properties
file: The correct path for YARN is as follows. By default thesqoop.properties
file contains the path for MRv1.NOTE: As of Sqoop2 1.99.6-1507, Sqoop2 automatically sets the Hadoop configuration directory based on the Hadoop version that is used by the cluster./opt/mapr/sqoop/sqoop-2.0.0/server/conf/sqoop.properties
# Hadoop configuration directory org.apache.sqoop.submission.engine.mapreduce.configuration.directory=/opt/mapr/hadoop/hadoop-2.x.x/etc/hadoop/
-
Start each Sqoop server node.
maprcli node services -name sqoop2 -action start -nodes <space delimited list of nodes>