Lab: Hadoop and HDFS in a simulated production environment
Author
- Stéphane Derrode & Lamia Derrode, Centrale Lyon, Dpt Mathématiques & Informatique
Objectives
This lab follows the course on the open-source framework called Hadoop, developed and maintained by the Apache Foundation. The objective is to use Hadoop and HDFS in a simulated production environment, using a Docker container.
Step #1 : map-reduce, stand-alone mode.
Step #2 : Hadoop installation (using a pre-configured Docker container).
Step #3 : map-reduce, Hadoop cluster mode.
Exercise : For those who have the time, here is an exercise on big matrix-vector multiplication.