Spark Decoded Repository for my literature survey. Code is stored in the master branch, website in gh-pages. TODO Persist RDDs Concurrency Fault tolerance