Exercise: Working with Datasets Using JDBC and PostgreSQL
Develop a standalone Spark SQL application that reads data from a JDBC database, e.g. PostgreSQL.
Module: Spark SQL
Duration: 30 mins
Steps
- spark-submit –packages to submit Spark application with PostgreSQL JDBC Driver
- Use Spark Standalone cluster
- Troubleshoot missing PostgreSQL JDBC Driver jar
Useful Links
- Working with Datasets from JDBC Data Sources (and PostgreSQL)