Statistics are supported for the following only:

  1. Hive Metastore tables for which ANALYZE TABLE <tableName> COMPUTE STATISTICS noscan has been executed
  2. File-based data source tables for which the statistics are computed directly on the files of data

Broadcast Join

Broadcast Join can be automatically selected by the Spark Planner based on the Statistics and the spark.sql.autoBroadcastJoinThreshold configuration property.

Last update: 2021-05-03
