Performs an inner hash join of two child relations.
Performs an inner hash join of two child relations. When the output RDD of this operator is being constructed, a Spark job is asynchronously started to calculate the values for the broadcast relation. This data is then placed in a Spark broadcast variable. The streamed relation is not shuffled.
:: DeveloperApi :: Performs a local hash join of two child relations.
:: DeveloperApi :: Performs a local hash join of two child relations. If a relation (out of a datasource) is already replicated across all nodes then rather than doing a Broadcast join which can be expensive, this join just scans through the single partition of the replicated relation while streaming through the other relation.
Performs a hash join of two child relations by first shuffling the data using the join keys.
Base trait for joins used in SnappyData.
Base trait for joins used in SnappyData. Currently this allows children to have subsets of join keys as partitioning columns without introducing a shuffle.
Extension to Spark's SortMergeJoinExec to avoid exchange for cases when join keys are a subset of child plan partitioning.
Performs a sort merge join of two child relations.
An optimized CartesianRDD for UnsafeRow, which will cache the rows from second child RDD, will be much faster than building the right partition for every row in left RDD, it also materialize the right RDD (in case of the right RDD is nondeterministic).
Physical execution operators for join operations.