How MapReduce Works (in Hadoop)
Content
- Lifecycle of a MapReduce Job
- Components in a Hadoop MR Workflow
- Job Submission
- Initialization
- Scheduling
- Execution
- Map Task
- Sort Buffer
- Reduce Tasks
- Dealing with Failures and Slow Tasks
- HDFS Architecture
- Job Configuration Parameters
- Hadoop Job Configuration Parameters
- Tuning Hadoop Job Conf. Parameters
- Experimental Setting
- Parameters Varied in Experiments
- Hadoop 50GB TeraSort
- Automatic Optimization