Spark and Scala Workshop for Developers (1 Day)
What You Will Learn (aka Goals)
This one-day Spark and Scala workshop is supposed to give you a practical and hands-on introduction to the architecture of Apache Spark and the tools that you use to develop Spark applicatons in your Big Data projects.
The workshop uses an intense code-first approach in which the modules start with just enough knowledge to get you going (using slides, scaladoc and live coding) and quickly move on to applying the concepts in programming assignments. There are a lot of them.
The programming language to use during the course is Scala.
Your dear drill sergeant, Jacek.
Duration
1 day / 8 hours / 9:00 AM - 5:00 PM
Agenda
- Course Introduction - 10 mins
- Introduction to Scala, sbt, and IntelliJ IDEA for Spark Developers - 70 mins
- Break - 10 mins
- Introduction to Apache Spark using spark-shell and web UI - 90 mins
- Lunch Break - 45 mins
- My first Spark SQL standalone application in Scala with spark-submit - 60 mins
- Deploying Spark applications to Spark Standalone cluster - 30 mins
- Break - 10 mins
- Functions and Operators in Spark SQL - 60 mins
- Break - 10 mins
- Developing Machine Learning applications using Spark MLlib - 90 mins
Target Audience
- Software developers who want to learn the Scala API for Apache Spark
- Software developers with programming experience in a similar general-purpose programming language (Java, Python, Ruby)
Requirements
- Training classes are best for groups up to 8 participants.
- FIXME: Link to the other courses wherever possible and make it consistent