Apache Spark

Apache Spark is a unified analytics engine for large-scale data processing, used for big data and machine learning tasks. Coursera's Apache Spark skill catalogue teaches you about this powerful tool for handling big data analytics. You'll learn the fundamentals of Spark's distributed computing model, its powerful data processing capabilities, and how to implement machine learning algorithms with Spark. You'll also delve into Spark SQL for structured data processing, Spark Streaming for real-time data processing, and MLlib for machine learning tasks. Master these aspects to enhance your data science skills and solve complex computational problems in various industries.
36credentials
1online degree
75courses

Filter by

Subject
Required

Language
Required

The language used throughout the course, in both instruction and assessments.

Learning Product
Required

Build job-relevant skills in under 2 hours with hands-on tutorials.
Learn from top instructors with graded assignments, videos, and discussion forums.
Learn a new tool or skill in an interactive, hands-on environment.
Get in-depth knowledge of a subject by completing a series of courses and projects.
Earn career credentials from industry leaders that demonstrate your expertise.
Earn your Bachelor’s or Master’s degree online for a fraction of the cost of in-person learning.

Level
Required

Duration
Required

Subtitles
Required

Educator
Required

Explore the Apache Spark Course Catalog

  • Status: New
    Status: Free Trial

    Skills you'll gain: PySpark, Apache Spark, Classification And Regression Tree (CART), Predictive Modeling, Applied Machine Learning, Statistical Machine Learning, Unsupervised Learning, Predictive Analytics, Random Forest Algorithm, Regression Analysis, Machine Learning Algorithms, Supervised Learning, Data Pipelines

  • Status: Free Trial

    Skills you'll gain: Big Data, Data Analysis, Statistical Analysis, Apache Hadoop, Apache Hive, Data Collection, Data Science, Data Warehousing, Data Visualization, Data Cleansing, Apache Spark, Data Lakes, Data Visualization Software, Relational Databases, Microsoft Excel

  • Status: New
    Status: Free Trial

    Skills you'll gain: Apache Kafka, Data Warehousing, Extract, Transform, Load, Microsoft SQL Servers, Snowflake Schema, Star Schema, Performance Tuning, Data Pipelines, Cloud Computing Architecture, Business Intelligence, Real Time Data, Apache Hadoop, Data Modeling, Data Quality, Responsible AI, Apache Spark, SQL, Generative AI, Data Governance, Quality Management

  • Status: Free Trial

    Skills you'll gain: Databricks, Apache Spark, Microsoft Azure, PySpark, Data Lakes, Data Processing, Jupyter, File Systems, File Management, Cloud Storage, Cloud Computing Architecture

  • Status: New
    Status: Free Trial

    Skills you'll gain: Feature Engineering, AWS SageMaker, Data Cleansing, Apache Spark, Extract, Transform, Load, Data Pipelines, Data Transformation, Amazon Web Services, Responsible AI, Data Quality, Data Integrity, Data Validation, Personally Identifiable Information, Machine Learning Methods

  • Status: New
    Status: Free Trial

    Skills you'll gain: AWS Kinesis, Amazon Redshift, Real Time Data, Data Processing, Data Pipelines, Serverless Computing, Apache Spark, Data Visualization, Big Data, Amazon Web Services, Advanced Analytics, Performance Tuning, Extract, Transform, Load, Amazon CloudWatch, Amazon S3, Data Transformation, Scalability

  • Status: Free Trial

    Skills you'll gain: Dataflow, Google Cloud Platform, Data Pipelines, Data Import/Export, Feature Engineering, Real Time Data, Tensorflow, Data Lakes, Apache Spark, Dashboard, Big Data, Data Warehousing, Applied Machine Learning, Data Management, Data Infrastructure, Cloud Engineering, Unstructured Data, Cloud Storage, MLOps (Machine Learning Operations), PySpark

  • Status: Free Trial

    Skills you'll gain: Google Cloud Platform, Real Time Data, Data Pipelines, Dataflow, Tensorflow, Cloud Engineering, Data Lakes, Big Data, Dashboard, Cloud Infrastructure, Apache Spark, Data Infrastructure, Unstructured Data, Applied Machine Learning, Data Warehousing, Extract, Transform, Load, MLOps (Machine Learning Operations), Data Processing, PySpark, Cloud Storage

  • Status: Free Trial

    Skills you'll gain: PySpark, Databricks, Apache Spark, MLOps (Machine Learning Operations), Microsoft Azure, Big Data, Scikit Learn (Machine Learning Library), Applied Machine Learning, Data Processing, Deep Learning, Data Transformation, Machine Learning, Exploratory Data Analysis

  • Status: New

    Skills you'll gain: Data Pipelines, Data Warehousing, SQL, Google Cloud Platform, Data Processing, Data Quality, Apache Spark, Generative AI, Applied Machine Learning, Big Data, Serverless Computing, Machine Learning, Data Analysis, Time Series Analysis and Forecasting

  • Status: New
    Status: Free Trial

    Skills you'll gain: PySpark, Apache Hadoop, Apache Spark, Big Data, Apache Hive, Analytics, Data Processing, Data Mapping, Text Mining, Distributed Computing, Java, Debugging, Java Programming

  • Status: Preview

    Skills you'll gain: SQL, Data Management, Databases, Apache Spark, Data Architecture, Data Processing