Spark SQL 2.2
Workshop 3 Days

@jaceklaskowski / StackOverflow / GitHub
Books: Mastering Apache Spark / Spark Structured Streaming

https://github.com/jaceklaskowski

https://bit.ly/mastering-apache-spark

Ranked #89 in Spark contributors

http://stackoverflow.com/users/1305344/jacek-laskowski

https://twitter.com/jaceklaskowski

Goal

Developing hands-on experience in
Spark SQL
to exploit massive datasets at rest
for advanced analytics and data-oriented decision making

Mastering Apache Spark notebook featured in the Big Data course at Coursera!

Mastering Apache Spark featured in Big Data course at Coursera

Prerequisities

  1. Some programming experience using modern programming language, e.g. Scala, Python, Java, F#
  2. Installed: Java Platform, Standard Edition (Java SE) 8
  3. Downloaded: Apache Spark 2.2.0
    • Pre-built for Apache Hadoop 2.7 and later
    • spark-2.2.0-bin-hadoop2.7.tgz