- RDD.cartesian(other)[source]#
Return the Cartesian product of this RDD and another one, that is, the RDD of all pairs of elements
(a, b)whereais in self andbis in other.New in version 0.7.0.
Examples
>>> rdd = sc.parallelize([1, 2]) >>> sorted(rdd.cartesian(rdd).collect()) [(1, 1), (1, 2), (2, 1), (2, 2)]
pyspark.RDD.cartesian — PySpark In-Progress documentation