-
Course Topics 1 min
-
Transformations & Actions on Spark Datasets 8 min
-
Cache Datasets 6 min
-
User Defined Functions (UDFs) 4 min
-
Repartition Datasets 2 min
-
Quiz
- What's Next
-
Get more Apache Spark training
This video is still being processed. Please check back later and refresh the page.
Uh oh! Something went wrong, please try again.
Operations, Caching & UDFs in Apache Spark
Take advantage of step-by-step process overviews of applying transformations on datasets, caching to improve system performance, and UDFs.
This course on Operations, Caching, & UDFs in Apache Spark provides several step-by-step process overviews on applying transformations on datasets. Caching to improve system performance, Spark user defined functions, or UDFs, are also covered. Lastly, the course covers information about repartitioning datasets to manually control partitions as needed.
What's Covered
Transformations & Actions on Spark Datasets
|
Cache Datasets
|
User Defined Functions (UDFs)
|
Repartition Datasets
|
For more information on how HPE manages, uses and protects your information please refer to HPE Privacy Statement. You can always withdraw or modify your consent to receive marketing communication from HPE. This can be done by using the opt-out and preference mechanism at the bottom of our email marketing communication or by following this link.
×