https://bit.ly/mastering-apache-spark
Please DON'T do this!
sc.textFile("...CSV") <-- HERE
.filter(lambda l : "ADD_NUM" not in l)
.map(parseCultureLoc)
sc.textFile("...TXT") <-- HERE
.map(parseTwitter)
.filter(lambda t : "toronto" in t['place'].lower()))
sc.parallelize(grid)
Please DON'T do this!
wdat <- read.df(sqlContext,
"./ParkingTicket/weather/201*",
source = "com.databricks.spark.csv", <-- this
inferSchema = "true",
header="true", skiprows=16)
sqlContext.jsonFile(file_path1)
Please DON'T do this!
Please DON'T do this!
parking_2015_withTrial_df
.coalesce(1) <------------------------- HERE
.write
.format("com.databricks.spark.csv") <-- HERE
.options(header="true")
.save("/mnt/%s/Parking/Parking_Tags_2015_and_other_data" % MOUNT_NAME)
Please DON'T do this!
Nothing here...you're on your own :-)
I did learn a few things from YOU! Thanks!