Udemy
    •  
    •  
    •  
    •  
    •  
    •  
    •  
    •  
Turn what you know into an opportunity and reach millions around the world.
Learn More
Your cart is empty.
Keep shopping
A Crash Course In PySpark
Rating: 4.5 out of 5(8,830 ratings)
40,795 students

A Crash Course In PySpark

Learn all the fundamentals of PySpark
Created byKieran Keene
Last updated 4/2023
English

What you'll learn

  • PySpark, Apache Spark, Big Data Analytics, Big Data Processing, Python

Course content

5 sections20 lectures1h 15m total length
  • Introduction0:47
  • How is this course structured0:55

Requirements

  • Python Familiarity, which can be learned through my 'No Nonsense Python' course

Description

Spark is one of the most in-demand Big Data processing frameworks right now.


This course will take you through the core concepts of PySpark. We will work to enable you to do most of the things you’d do in SQL or Python Pandas library, that is:

  • Getting hold of data

  • Handling missing data and cleaning data up

  • Aggregating your data

  • Filtering it

  • Pivoting it

  • And Writing it back

All of these things will enable you to leverage Spark on large datasets and start getting value from your data.

Let’s get started.

Who this course is for:

  • People wanting to leverage their big data with Spark