Content for Chapter 1, Big Data Technologies
Here is a detailed content of class sessions for Chapter 1 of module Big Data computing technologies (4th semester), BSC in Data Science for Responsible Business (Centrale Lyon & EM Lyon).
Part 1. Linked Open Data (LOD) technology and project.¶
-
Teaching materials
- Slides
- Query examples with examples from the course (and other examples).
-
Educational resources
-
Book (available at the Centrale Lyon library)
- Learning SparQL, by Bob Ducharme, 2nd Edition, 2011, O’Reilly. (pdf copies can be found on the Internet!)
-
SparQL language reference
- SparQL language from W3C.
- SparQL By Example: The Cheat Sheet.
- SparQL query-validator. As a bonus, it re-indents and improves the readability of your code!
-
Videos
- Big Data in 5 minutes
- What is Linked Open Data? (Introduction for students)
- What is Linked Data ? (A short non-technical introduction to Linked Data)
- SPARQL in 11 minutes
-
-
Practical work
-
LOD Project (in groups of 3 students)
Part 2. Hadoop framework¶
-
Teaching materials
-
Educational resources
-
Videos
- Hadoop In 5 Minutes
- What Is Hadoop? . 30 minutes introduction for beginners
- HDFS Tutorial For Beginners. 43 minutes
-
-
Practical works