November 08, 2016

Paper assignments for "large scale parallel processing" week

MapReduce: simplified data processing on large clusters (2008)

  • Muzammil A.
  • Jingjing R.
  • Connor Z.

DryadLINQ: A System for General-Purpose Distributed Data-Parallel Computing Using a High-Level Language (2008)

  • Avanti P.
  • Sam C.

Resilient distributed datasets: A fault-tolerant abstraction for in-memory cluster computing (2012)

  • Fangfan L.
  • Abhilash M.
  • Kisalaya P.

Spark SQL: Relational Data Processing in Spark (2015)

  • Aviral G.
  • James L.
  • Nat D.