012-Spark RDDs
From the beginning, the foundation of Spark was the Resilient Distributed Dataset or RDD. Understanding RDDs is essential to using Spark. It will open the door to DataFrames, MLlib, SparkSQL and more.
From the beginning, the foundation of Spark was the Resilient Distributed Dataset or RDD. Understanding RDDs is essential to using Spark. It will open the door to DataFrames, MLlib, SparkSQL and more.
Пікірлер: 9
Thanks, way more useful than my 2 hour lecture!
Thank you so much, this made things so much clearer for me! I wish you had more videos, I would have watched everything
The best explanation of RDDs. Thank you soooo much :-)
Thanks!! It was a great explanation
Very clearly explained man!
Well explained! Thanks!
the lazy evaluation point was great, really clicked there.
count on RDD is an action, not a transformation
Many a mickle