spark-workshop
Exercise: Working With Files On Hadoop HDFS
Download Apache Hadoop
Start a single-node HDFS cluster
Develop a Spark SQL application
Loads a file from HDFS into a DataFrame
Transforms the dataset
Saves the dataset to HDFS
Duration: 45 mins