Partitions correspond to Hadoop's splits (if the data lives on HDFS) or partitioning schemes in the source storage
RDD (and hence the data inside) is partitioned.
Spark manages data using partitions that helps parallelize distributed data processing with minimal network traffic for sending data between executors.
Data in partitions can be skewed, i.e. unevenly distributed across partitions.
RDD Operators
Transformation is a lazy RDD operation that creates one or many RDDs
Action is a RDD operation that produces non-RDD Scala values