spark-workshop

Exercise: Streaming CSV Datasets

Develop a standalone Spark Structured Streaming application (using IntelliJ IDEA) that runs a streaming query that loads CSV files and prints their content out to the console.

The query should use csv streaming source and console streaming sink.

Use sbt package and spark-submit to run the application.

Module: Spark Structured Streaming

Duration: 30 mins