Instructor: Heather Miller heather at ccs dot neu dot edu WVH242 (temp) & WVH302D Fall 2016 Thursdays 6pm-9pm Behrakis Center Room 204
November 08, 2016
MapReduce: simplified data processing on large clusters (2008)
DryadLINQ: A System for General-Purpose Distributed Data-Parallel Computing Using a High-Level Language (2008)
Resilient distributed datasets: A fault-tolerant abstraction for in-memory cluster computing (2012)
Spark SQL: Relational Data Processing in Spark (2015)