Spark 2.0 / Scala Workshop 2 Days

Jacek Laskowski / @jaceklaskowski / GitHub / Mastering Apache Spark Notes

https://github.com/jaceklaskowski

http://bit.ly/mastering-apache-spark

Among contributors to Apache Spark 1.6

Among contributors to Apache Spark 2

Agenda

  • Day 1. Scala and My First Scala Application
  • Day 1. Introduction to Apache Spark 2
  • Day 1. Developing Spark SQL Applications (in Scala)
  • Day 2. Developing Spark SQL Applications (in Scala)
  • Day 2. Using Spark MLlib and ML Pipelines

Prerequisities

  1. Working VirtualBox image (with the tools)
    • Java SE 8, Apache Spark 2, IntelliJ IDEA, sbt
  2. Some programming skills using modern programming language (preferably on JVM)
    • C#, Java, Python, Scala
  3. Willingness to ask PLENTY of questions

Questions?

- Read Mastering Apache Spark notes
- Follow @jaceklaskowski at twitter
- Use Jacek's projects at GitHub
- Visit Jacek Laskowski's blog