Spark 2.0 / Scala Workshop Day 2

Jacek Laskowski / @jaceklaskowski / GitHub / Mastering Apache Spark Notes

Agenda - Day 2 (1 of 2)

  • Introduction to Spark SQL
  • SparkSession — The Entry Point
  • DataSource API — Loading and Writing Datasets
  • Exercise: My First Spark SQL Application (using IntelliJ IDEA)
  • Spark Architecture and Cluster Managers
  • Exercise: spark-submit Your Spark App / run-example SparkPi

Agenda - Day 2 (2 of 2)

  • Using Functions and Operators in Spark SQL
  • Exercise: Executing Queries from Command-Line / CSV
  • Web UI
  • Scala functions and UDFs in Spark SQL
  • Exercise: Scala functions and UDFs

Questions?