Lab: Hadoop map-reduce with MrJob library

Authors

Objectives

This lab uses the MrJob library to implement map-reduce algorithms that leverage sequences of map and reduce phases. We will first use this library locally, and then on the Hadoop cluster installed in the Docker container (see the previous lab).




Step #1: Install and test the MrJob library locally.

Step #2: Answer the exercises in this statement.

Step #3: Test a MrJob algorithm on the Hadoop cluster.