Apache Spark 2 Workshop 4 days

@jaceklaskowski / StackOverflow / GitHub
Gitbooks: Mastering Apache Spark / Spark Structured Streaming

https://github.com/jaceklaskowski

https://bit.ly/mastering-apache-spark

Ranked #91 in Spark contributors

https://stackoverflow.com/tags/apache-spark/topusers

https://twitter.com/jaceklaskowski

Agenda

  • Day 1aJust Enough Scala and Tools (sbt, IntelliJ IDEA)
  • Day 1bSpark SQL and Tools (spark-shell, Databricks)
  • Day 2Advanced Spark SQL
  • Day 3aSpark Architecture and Core Concepts
  • Day 3bMonitoring using web UI
  • Day 3cSpark Streaming + Kafka
  • Day 4aStructured Streaming (Kafka, files)
  • Day 4bSpark MLlib

Mastering Apache Spark notebook featured in the Big Data course at Coursera!

Mastering Apache Spark featured in Big Data course at Coursera

Prerequisities (1 of 3)

  1. Some programming experience using modern programming language, e.g. Scala, Python, Java, F#

Prerequisities (2 of 3)

  1. Installed

Prerequisities (3 of 3)

  1. Downloaded

Questions?