Lab: Hadoop map-reduce with MrJob library
Authors
- Stéphane Derrode & Lamia Derrode, Centrale Lyon, Dpt Mathématiques & Informatique
Objectives
This lab uses the MrJob library to implement map-reduce algorithms that leverage sequences of map and reduce phases. We will first use this library locally, and then on the Hadoop cluster installed in the Docker container (see the previous lab).
Step #1: Install and test the MrJob library locally.
Step #2: Answer the exercises in this statement.
Step #3: Test a MrJob algorithm on the Hadoop cluster.