Apache Spark
Content
About me
Distributed Computing at a High Level
Disk versus Memory based Systems
Spark Core
Brief background
Benchmarks and Comparisons
What is an RDD
RDD Actions and Transformations
Caching and Serialization
Anatomy of a Program
The Spark Family