## Warning: package 'dplyr' was built under R version 3.4.2
Date Topic Reading Assignments Code.reviews
9/8 Introduction to distributed parallel processing and overview of applications in data analytics
9/12 Processing data concurrently and in parallel JMM
9/15 Reproducibility, Measurements, Performance evaluation HTDG 1-2 A0
9/19 Programming with Map Reduce MR04, HTDG 3-4
9/22 Programming with Map reduce HTDG 5-6 A1
9/26 Implementing Hadoop HTDG 7-8 A1
9/29 Implementing Hadoop MAS11, J+12, HTDG 9 A2
10/3 Beyond Map Reduce S14, SK12, R+12 A2
10/6 Beyond Map Reduce HTDG 16-18 A3
10/10 Data-parallel pipelines HTDG 18, FJ10
10/13 Midterm evaluation — A4
10/17 Scala: an Introduction O+04 A4
10/20 Spark: Basics HTDG 19, Z+12 A5
10/24 Spark: Basics KKWZ 1-6 A5
10/27 Spark: Accumulators and broadcast variables KKWZ 9, OS6 A6
10/31 Guest lecture: Working at a big data startup A6
11/3 Spark: Relational databases KKWZ 6 A7
11/7 Spark: SparkR A+15, KKWZ 7, M+16 A7
11/10 [ Veterans’ Day ]
11/14 Spark: Scaling KKWZ 10, AD15
11/17 Spark: Streaming B+16, S+16 Project
11/21 H2O Project
11/24 [ Thanksgiving ]
12/1 TensorFlow A+16 Project
12/5 Final exam and Project Presentations Project
12/8 Final exam and Project Presentations Project