Configuring MapReduce Mode for Ecosystem Clients

For Ecosystem clients that directly submit MapReduce jobs to the cluster, the MapReduce mode will default to the MapReduce mode of the client node from which the job is submitted.

In the following figure, Pig is installed on a client node with default_mode=yarn. Pig directly submits the application to the ResourceManager in the cluster.

When the ecosystem component connects directly to the cluster (not through a service), you can set the MapReduce version for the job or application in an environment variable. The mode that you set in an environment variable overrides the default MapReduce mode that is set on the client node and the cluster.