ExpressLane
MapR provides an express path (called ExpressLane) that works in conjunction with the Fair Scheduler. ExpressLane is for small MapReduce jobs to run when all slots are occupied by long tasks. ExpressLane is not available on YARN.
Small jobs are only given this special treatment when the cluster is busy, and only if they
meet the criteria specified by the following parameters in mapred-site.xml
:
Parameter | Value | Description |
---|---|---|
mapred.fairscheduler.smalljob.schedule.enable | true | Enable small job fast scheduling inside fair scheduler. TaskTrackers should reserve a slot called ephemeral slot, which is used for smalljob if the cluster is busy. |
mapred.fairscheduler.smalljob.max.reducers | 10 | Small job definition. Max number of reducers allowed in small job. |
mapred.fairscheduler.smalljob.max.reducer.inputsize | 1073741824 | Small job definition. Max estimated input size for a reducer allowed in small job. Default is 1GB per reducer. |
mapred.fairscheduler.smalljob.max.maps | 10 | Small job definition. Max number of maps allowed in small job. |
mapred.fairscheduler.smalljob.max.inputsize | 10737418240 | Small job definition. Max input size in bytes allowed for a small job. Default is 10GB. |
mapred.cluster.ephemeral.tasks.memory.limit.mb | 200 | Small job definition. Max memory in mbytes reserved for an ephermal slot. Default is 200mb. This value must be same on JobTracker and TaskTracker nodes. |
MapReduce jobs that appear to fit the small job definition but are in fact larger than anticipated are killed and re-queued for normal execution.