This video is still being processed. Please check back later and refresh the page.

Uh oh! Something went wrong, please try again.

Operations, Caching & UDFs in Apache Spark

Take advantage of step-by-step process overviews of applying transformations on datasets, caching to improve system performance, and UDFs.

rate limit

Code not recognized.

About this Course

This course on Operations, Caching, & UDFs in Apache Spark provides several step-by-step process overviews on applying transformations on datasets. Caching to improve system performance, Spark user defined functions, or UDFs, are also covered. Lastly, the course covers information about repartitioning datasets to manually control partitions as needed.

What's Covered

Transformations & Actions on Spark Datasets

  • Transformations & Actions
  • Datasets
  • Relational Grouped Datasets

Cache Datasets

  • Caching Datasets
  • Step by Step Instructions on Caching

User Defined Functions (UDFs)

  • Types of UDFs
  • Scala UDFs
  • SQL UDFs

Repartition Datasets

  • Why Repartition?
  • Example Partitions

 

Curriculum21 min

  • Preview
    Course Topics 1 min
  • Transformations & Actions on Spark Datasets 8 min
  • Cache Datasets 6 min
  • User Defined Functions (UDFs) 4 min
  • Repartition Datasets 2 min
  • Quiz
  • What's Next
  • Get more Apache Spark training

About this Course

This course on Operations, Caching, & UDFs in Apache Spark provides several step-by-step process overviews on applying transformations on datasets. Caching to improve system performance, Spark user defined functions, or UDFs, are also covered. Lastly, the course covers information about repartitioning datasets to manually control partitions as needed.

What's Covered

Transformations & Actions on Spark Datasets

  • Transformations & Actions
  • Datasets
  • Relational Grouped Datasets

Cache Datasets

  • Caching Datasets
  • Step by Step Instructions on Caching

User Defined Functions (UDFs)

  • Types of UDFs
  • Scala UDFs
  • SQL UDFs

Repartition Datasets

  • Why Repartition?
  • Example Partitions

 

Curriculum21 min

  • Preview
    Course Topics 1 min
  • Transformations & Actions on Spark Datasets 8 min
  • Cache Datasets 6 min
  • User Defined Functions (UDFs) 4 min
  • Repartition Datasets 2 min
  • Quiz
  • What's Next
  • Get more Apache Spark training

For more information on how HPE manages, uses and protects your information please refer to HPE Privacy Statement. You can always withdraw or modify your consent to receive marketing communication from HPE. This can be done by using the opt-out and preference mechanism at the bottom of our email marketing communication or by following this link.

×