Copying Data from Apache Hadoop
You can use the hdfs protocol, webhdfs protocol, or NFS to copy data from Apache Hadoop to a MapR cluster.
The following table provides descriptions of the methods:
Method | Description |
---|---|
hdfs:// protocol | You can use the hadoop distcp command with the
hdfs:// protocol to copy data from an HDFS cluster into a MapR
cluster if the HDFS cluster and the MapR cluster use the same RPC protocol version.
For all other scenarios, use the webhdfs:// protocol or NFS gateway
to copy data to a MapR cluster. |
webhdfs:// protocol | You can use the hadoop
distcp command with the webhdfs:// protocol to
copy data from an HDFS cluster into a MapR cluster. |
NFS | You can mount a MapR cluster to an HDFS cluster via NFS mount and then
use the hadoop distcp command to copy data between the two
clusters. |
Refer to the following sections for information about how to copy data from Hadoop to a MapR cluster: