Apache Spark

Apache Spark is a unified analytics engine for large-scale data processing, used for big data and machine learning tasks. Coursera's Apache Spark skill catalogue teaches you about this powerful tool for handling big data analytics. You'll learn the fundamentals of Spark's distributed computing model, its powerful data processing capabilities, and how to implement machine learning algorithms with Spark. You'll also delve into Spark SQL for structured data processing, Spark Streaming for real-time data processing, and MLlib for machine learning tasks. Master these aspects to enhance your data science skills and solve complex computational problems in various industries.
36credentials
1online degree
75courses

Filter by

Subject
Required

Language
Required

The language used throughout the course, in both instruction and assessments.

Learning Product
Required

Build job-relevant skills in under 2 hours with hands-on tutorials.
Learn from top instructors with graded assignments, videos, and discussion forums.
Learn a new tool or skill in an interactive, hands-on environment.
Get in-depth knowledge of a subject by completing a series of courses and projects.
Earn career credentials from industry leaders that demonstrate your expertise.
Earn your Bachelor’s or Master’s degree online for a fraction of the cost of in-person learning.

Level
Required

Duration
Required

Subtitles
Required

Educator
Required

Explore the Apache Spark Course Catalog

  • Status: Free Trial

    Skills you'll gain: Apache Hadoop, Apache Spark, PySpark, Apache Hive, Big Data, IBM Cloud, Kubernetes, Docker (Software), Scalability, Data Processing, Distributed Computing, Performance Tuning, Data Transformation, Debugging

  • Status: Free Trial

    Skills you'll gain: Apache Spark, Machine Learning, Generative AI, PySpark, Applied Machine Learning, Supervised Learning, Apache Hadoop, Data Pipelines, Unsupervised Learning, Feature Engineering, Data Processing, Extract, Transform, Load, Predictive Modeling, Data Transformation, Regression Analysis

  • Status: New
    Status: Free Trial

    Skills you'll gain: PySpark, Apache Spark, MySQL, Data Pipelines, Scala Programming, Extract, Transform, Load, Customer Analysis, Apache Hadoop, Classification And Regression Tree (CART), Predictive Modeling, Applied Machine Learning, Data Processing, Advanced Analytics, Big Data, Apache Maven, Statistical Machine Learning, Unsupervised Learning, SQL, Apache, Python Programming

  • Status: Preview

    Skills you'll gain: PySpark, Apache Spark, Data Management, Distributed Computing, Apache Hadoop, Data Processing, Data Analysis, Exploratory Data Analysis, Python Programming, Scalability

  • Status: Free Trial

    Skills you'll gain: NoSQL, Apache Spark, Apache Hadoop, MongoDB, PySpark, Extract, Transform, Load, Apache Hive, Databases, Apache Cassandra, Big Data, Machine Learning, Applied Machine Learning, Generative AI, Machine Learning Algorithms, IBM Cloud, Kubernetes, Supervised Learning, Distributed Computing, Docker (Software), Database Management

  • Status: Free Trial
    Status: AI skills

    Skills you'll gain: NoSQL, Apache Spark, Data Warehousing, Apache Hadoop, Extract, Transform, Load, Apache Airflow, Web Scraping, Linux Commands, Database Design, SQL, IBM Cognos Analytics, MySQL, Database Administration, Data Store, Generative AI, Professional Networking, Data Import/Export, Python Programming, Data Analysis, Data Science

What brings you to Coursera today?

  • Status: Free Trial

    LearnKartS

    Skills you'll gain: Apache Kafka, Apache Spark, Prometheus (Software), Data Pipelines, Distributed Computing, Real Time Data, Data Processing, Security Controls, Configuration Management, Application Performance Management, Performance Tuning, Encryption, Authorization (Computing), Authentications, Data Storage Technologies, Server Administration, Network Monitoring, File Management

  • Status: New
    Status: Free Trial

    Skills you'll gain: PySpark, Apache Hadoop, Apache Spark, Big Data, Apache Hive, Data Lakes, Analytics, Data Pipelines, Data Processing, Data Import/Export, Data Integration, Linux Commands, Data Mapping, Linux, File Systems, Text Mining, Data Management, Distributed Computing, Java, C++ (Programming Language)

  • Status: Free Trial

    École Polytechnique Fédérale de Lausanne

    Skills you'll gain: Apache Spark, Apache Hadoop, Scala Programming, Distributed Computing, Big Data, Data Manipulation, Data Processing, Performance Tuning, Data Transformation, SQL, Data Analysis

  • Status: Free Trial

    Skills you'll gain: Databricks, CI/CD, Apache Spark, Microsoft Azure, Data Governance, Data Lakes, Data Architecture, Real Time Data, PySpark, Data Pipelines, Data Integration, Data Management, Automation, Data Storage, System Testing, Data Processing, Jupyter, Data Quality, User Provisioning, File Systems

  • Status: Free Trial

    Johns Hopkins University

    Skills you'll gain: Apache Hadoop, Big Data, Apache Hive, Apache Spark, NoSQL, Data Infrastructure, File Systems, Data Processing, Data Management, Analytics, Data Science, SQL, Query Languages, Data Manipulation, Java, Data Structures, Distributed Computing, Scripting Languages, Data Transformation, Performance Tuning

  • Status: Free Trial

    Skills you'll gain: Apache Spark, Scala Programming, Data Processing, Big Data, Applied Machine Learning, IntelliJ IDEA, Real Time Data, Graph Theory, Data Transformation, Development Environment, Distributed Computing, Build Tools, Regression Analysis, Performance Tuning