pyspark.RDD.stats — PySpark In-Progress documentation
-
RDD.stats()[source]#
Return a StatCounter object that captures the mean, variance
and count of the RDD’s elements in one operation.
New in version 0.9.1.
- Returns
StatCountera StatCounter capturing the mean, variance and count of all elements