Skip to content

Day 8 / Apr 13 (Wed)

Continuing the journey into the land of Spark Structured Streaming.

Morning Exercise

  1. Create a brand new project in IntelliJ IDEA
  2. Use rate data source as the source
  3. Use foreach to write data out
    1. Create a file for the records of a batch
    2. Open and close the file before and after processing the records
  4. Run the application from command line using spark-submit

Theory

  1. Joins
  2. web UI
Back to top