During our course we will learn how to process data with
We will learn to store out data in HDFS and MongoDB.
The course will be using the following 2 languages
You can submit your assignments in any of those!
| Week | Lecture | Who? | Topic | Teacher |
|---|---|---|---|---|
| 13/11 | 1 | All | Course introduction, Big and Fast data, Intro to course PLs | GG |
| 13/11 | 2 | All | Programming for Big Data (1) | GG |
| 20/11 | 1 | All | Programming for Big Data (2) | GG |
| 20/11 | 2 | All | Distributed Systems | GG |
| 27/11 | 1 | All | Distributed Databases | GG |
| 27/11 | 2 | All | Map/Reduce and Hadoop | GG |
| 4/12 | 1 | All | Spark: RDDs and Pair RDDs | GG |
| 4/12 | 2 | All | Spark Internals | JR |
| Week | Lecture | Who? | Topic | Teacher |
|---|---|---|---|---|
| 11/12 | 1 | All | Spark SQL, Synonyms with Word2Vec | GG |
| 11/12 | 2 | All | Recommending bands, Predicting pull request merges | GG |
| 18/12 | 1 | BSc | Stream processing | AK |
| 18/12 | 2 | BSc | Stream processing systems | AK |
| 18/12 | 1 | Minors | Introduction to Data Science | GG |
| 18/12 | 2 | Minors | Introduction to Data science | GG |
| 8/1 | 1 | All | Big Graphs (Maybe) | GG |
| 8/1 | 2 | All | Graph processing systems (Maybe) | GG |
There will be resit, there is no mid-term.
You can transfer your assignment grade AS A WHOLE. No individual assignment resubmissions!
The course consists of 4 obligatory assignments, and 1 optional one.
You always work in groups
We will be using Jupyter notebooks to submit the assignments.
Grade: \(\frac{\sum_{assign=1}^{4} grade(assign)}{4}\), aka the mean
Most assignment grades add to \(> 10\)
Deadlines for each assignment are on the course’s web site
One day before the deadline (before 23:59) you must:
On the deadline date
All assignments are done in pairs
Generally, two types of Wednesdays
The TAs will add everybody to a Slack channel
bdp1.ewi.tudelft.nl from within TU Delft or VPN.Q A question with a known answer; this will be revealed, but we should work together towards it!
D A open discussion item; we need to think and discuss.
Freely available on the web, on my homepage (http://gousios.org/teaching.html)
You can print/download them before the lecture and bring them along to make additional notes.
I am looking forward to improve them! If you have suggestions etc, let me know!
This work is (c) 2017 - onwards by TU Delft and Georgios Gousios and licensed under the Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International license.