## Warning: package 'dplyr' was built under R version 3.4.2
9/8 |
Introduction to distributed parallel processing and overview of applications in data analytics |
|
|
|
9/12 |
Processing data concurrently and in parallel |
JMM |
|
|
9/15 |
Reproducibility, Measurements, Performance evaluation |
HTDG 1-2 |
A0 |
|
9/19 |
Programming with Map Reduce |
MR04, HTDG 3-4 |
|
|
9/22 |
Programming with Map reduce |
HTDG 5-6 |
A1 |
|
9/26 |
Implementing Hadoop |
HTDG 7-8 |
|
A1 |
9/29 |
Implementing Hadoop |
MAS11, J+12, HTDG 9 |
A2 |
|
10/3 |
Beyond Map Reduce |
S14, SK12, R+12 |
|
A2 |
10/6 |
Beyond Map Reduce |
HTDG 16-18 |
A3 |
|
10/10 |
Data-parallel pipelines |
HTDG 18, FJ10 |
|
|
10/13 |
Midterm evaluation |
— |
A4 |
|
10/17 |
Scala: an Introduction |
O+04 |
|
A4 |
10/20 |
Spark: Basics |
HTDG 19, Z+12 |
A5 |
|
10/24 |
Spark: Basics |
KKWZ 1-6 |
|
A5 |
10/27 |
Spark: Accumulators and broadcast variables |
KKWZ 9, OS6 |
A6 |
|
10/31 |
Guest lecture: Working at a big data startup |
|
|
A6 |
11/3 |
Spark: Relational databases |
KKWZ 6 |
A7 |
|
11/7 |
Spark: SparkR |
A+15, KKWZ 7, M+16 |
|
A7 |
11/10 |
[ Veterans’ Day ] |
|
|
|
11/14 |
Spark: Scaling |
KKWZ 10, AD15 |
|
|
11/17 |
Spark: Streaming |
B+16, S+16 |
Project |
|
11/21 |
H2O |
|
Project |
|
11/24 |
[ Thanksgiving ] |
|
|
|
12/1 |
TensorFlow |
A+16 |
Project |
|
12/5 |
Final exam and Project Presentations |
|
Project |
|
12/8 |
Final exam and Project Presentations |
|
Project |
|