This video is still being processed. Please check back later and refresh the page.

Uh oh! Something went wrong, please try again.

DataFrames, Datasets, and Schema in Apache Spark

Learn key differences between Apache Spark Datasets and DataFrames, along with details on data sources, structures, and schemas.

rate limit

Code not recognized.

About this Course

DataFrames, Datasets, and Schema in Apache Spark covers key differences between Apache Spark Datasets and DataFrames, and describes the different data sources and formats available to use with Apache Spark. Lessons give details on data sources, structures, and schemas, as well as creating DataFrames programmatically, and converting DataFrames into Datasets.

What's Covered

Data Sources, Structures, and Schemas
  • Review of data sources and format types
  • Overview of Spark DataFrames and Datasets
  • About Schemas and How to Define Them
Creating DataFrames Programmatically
  • Datasets vs. DataFrames
  • Creating Datasets
  • Spark Interactive Shell
Converting DataFrames into Datasets
  • Inferring Schema by Reflection
  • Defining Table Schema

Curriculum24 min

  • Preview
    Course Topics 1 min
  • Data Sources, Structures, and Schemas 5 min
  • Creating DataFrames Programmatically 10 min
  • Converting DataFrames into Datasets 8 min
  • Quiz
  • What's Next
  • Get more Apache Spark training

About this Course

DataFrames, Datasets, and Schema in Apache Spark covers key differences between Apache Spark Datasets and DataFrames, and describes the different data sources and formats available to use with Apache Spark. Lessons give details on data sources, structures, and schemas, as well as creating DataFrames programmatically, and converting DataFrames into Datasets.

What's Covered

Data Sources, Structures, and Schemas
  • Review of data sources and format types
  • Overview of Spark DataFrames and Datasets
  • About Schemas and How to Define Them
Creating DataFrames Programmatically
  • Datasets vs. DataFrames
  • Creating Datasets
  • Spark Interactive Shell
Converting DataFrames into Datasets
  • Inferring Schema by Reflection
  • Defining Table Schema

Curriculum24 min

  • Preview
    Course Topics 1 min
  • Data Sources, Structures, and Schemas 5 min
  • Creating DataFrames Programmatically 10 min
  • Converting DataFrames into Datasets 8 min
  • Quiz
  • What's Next
  • Get more Apache Spark training

For more information on how HPE manages, uses and protects your information please refer to HPE Privacy Statement. You can always withdraw or modify your consent to receive marketing communication from HPE. This can be done by using the opt-out and preference mechanism at the bottom of our email marketing communication or by following this link.

×