Apache Spark 2 Workshop 5 half-days

@jaceklaskowski / StackOverflow / GitHub
Notebooks: Mastering Apache Spark / Spark Structured Streaming

https://github.com/jaceklaskowski

https://bit.ly/mastering-apache-spark

Ranked #96 in Spark contributors

http://stackoverflow.com/users/1305344/jacek-laskowski

https://twitter.com/jaceklaskowski

Agenda

  • Day 1Just Enough Scala and Tools (sbt, IntelliJ IDEA)
  • Day 2Spark SQL and Tools (spark-shell, Databricks)
  • Day 3aAdvanced Spark SQL
  • Day 3bMonitoring using web UI
  • Day 4Spark MLlib
  • Day 5aStructured Streaming (files)
  • Day 5bStructured Streaming with Apache Kafka

Mastering Apache Spark notebook featured in the Big Data course at Coursera!

Mastering Apache Spark featured in Big Data course at Coursera

Prerequisities (1 of 3)

  1. Some programming experience using modern programming language, e.g. Scala, Python, Java, F#

Prerequisities (2 of 3)

  1. Installed

Prerequisities (3 of 3)

  1. Downloaded

Questions?