Day 2 / Apr 5 (Tue)¶
Introduction: Spark SQL¶
Next are 4 days to learn Spark SQL with Scala and SQL
- We know Scala already
- SQL is a brand new language to learn
Morning Exercise / Code Review¶
- Select one or more solutions for a code review
- Clone the repo
- Execute tests (
sbt test
) - Import the project to IntelliJ IDEA
- Take notes about interesting parts of the code
- Be ready with questions to the author
- (advanced) If you find a bug, write a failing test
- Time: 45 mins
Self-Study: Reading Official Documentation¶
- Spark Overview (up to Running the Examples and Shell)
- Submitting Applications
- Spark SQL, DataFrames and Datasets Guide
- The scaladoc of Dataset
Demo: Writing Your First Spark SQL Application¶
Based on Getting Started.