Skip to content

AggregateInPandasExec Physical Operator

AggregateInPandasExec is a unary physical operator that...FIXME

Creating Instance

AggregateInPandasExec takes the following to be created:

AggregateInPandasExec is created when Aggregation execution planning strategy is executed (for Aggregate logical operators with PythonUDF aggregate expressions only).

Executing Operator

doExecute(): RDD[InternalRow]

doExecute uses ArrowPythonRunner (one per partition) to execute PythonUDFs.

doExecute is part of the SparkPlan abstraction.

Last update: 2020-10-02