Apache Spark

Apache Spark is a unified analytics engine for large-scale data processing, used for big data and machine learning tasks. Coursera's Apache Spark skill catalogue teaches you about this powerful tool for handling big data analytics. You'll learn the fundamentals of Spark's distributed computing model, its powerful data processing capabilities, and how to implement machine learning algorithms with Spark. You'll also delve into Spark SQL for structured data processing, Spark Streaming for real-time data processing, and MLlib for machine learning tasks. Master these aspects to enhance your data science skills and solve complex computational problems in various industries.
36credentials
1online degree
73courses

Filter by

Subject
Required

Language
Required

The language used throughout the course, in both instruction and assessments.

Learning Product
Required

Build job-relevant skills in under 2 hours with hands-on tutorials.
Learn from top instructors with graded assignments, videos, and discussion forums.
Learn a new tool or skill in an interactive, hands-on environment.
Get in-depth knowledge of a subject by completing a series of courses and projects.
Earn career credentials from industry leaders that demonstrate your expertise.
Earn your Bachelor’s or Master’s degree online for a fraction of the cost of in-person learning.

Level
Required

Duration
Required

Subtitles
Required

Educator
Required

Explore the Apache Spark Course Catalog

  • Status: Free Trial

    Skills you'll gain: Databricks, CI/CD, Apache Spark, Microsoft Azure, Data Governance, Data Lakes, Data Architecture, Real Time Data, PySpark, Data Pipelines, Data Integration, Data Management, Automation, Data Storage, System Testing, Data Processing, Jupyter, Data Quality, User Provisioning, File Systems

  • Status: New
    Status: Free Trial

    Skills you'll gain: Responsible AI, MLOps (Machine Learning Operations), Artificial Intelligence and Machine Learning (AI/ML), Jenkins, CI/CD, Java, Continuous Deployment, Java Programming, Artificial Intelligence, Apache Spark, Applied Machine Learning, Decision Tree Learning, Deep Learning, Machine Learning, Fraud detection, Spring Boot, Natural Language Processing, Regression Analysis, Reinforcement Learning, Debugging

  • Skills you'll gain: Apache Spark, PySpark, Applied Machine Learning, Big Data, Data Storage Technologies, Statistical Machine Learning, Data Pipelines, Machine Learning Algorithms, Machine Learning, Data Processing, Data Science, Statistical Analysis

  • Status: Free Trial

    École Polytechnique Fédérale de Lausanne

    Skills you'll gain: Scala Programming, Apache Spark, Apache Hadoop, User Interface (UI), Programming Principles, Big Data, Software Design, Data Structures, Software Design Patterns, Functional Design, Data Manipulation, Object Oriented Programming (OOP), Heat Maps, Data Visualization Software, Interactive Data Visualization, Distributed Computing, Computer Programming, Data Processing, Real Time Data, Performance Tuning

  • Coursera Project Network

    Skills you'll gain: PySpark, Matplotlib, Apache Spark, Big Data, Data Processing, Distributed Computing, Data Management, Data Visualization, Data Analysis, Data Manipulation, Data Cleansing, Query Languages, Python Programming

  • Status: Free Trial

    Skills you'll gain: Feature Engineering, PySpark, Data Import/Export, Apache Spark, Dashboard, Cloud Services, Applied Machine Learning, Apache Hive, Application Programming Interface (API), Jupyter, Big Data, Artificial Intelligence and Machine Learning (AI/ML), Query Languages, Apache Hadoop, Serverless Computing, Application Deployment, Looker (Software), Cloud Computing, Scalability, SQL

  • Status: New
    Status: Free Trial

    Skills you'll gain: PySpark, Apache Spark, Customer Analysis, Big Data, Data Processing, Advanced Analytics, Statistical Modeling, Text Mining, Customer Insights, Data Mining, Data Transformation, Unstructured Data, Predictive Modeling, Simulation and Simulation Software, Data Manipulation, Marketing Analytics, Image Analysis, Risk Analysis

  • Status: Free Trial

    Skills you'll gain: Data Store, Extract, Transform, Load, Data Architecture, Data Pipelines, Big Data, Data Warehousing, Data Governance, Apache Hadoop, Relational Databases, Apache Spark, Data Lakes, Databases, SQL, NoSQL, Data Security, Data Science

  • Status: New
    Status: Free Trial

    Skills you'll gain: Java, Java Programming, Apache Spark, Applied Machine Learning, Deep Learning, Data Processing, Application Deployment, Natural Language Processing, Data Cleansing, Machine Learning Algorithms, Machine Learning, Feature Engineering, Data Transformation, Scalability, Artificial Neural Networks, Regression Analysis, Interoperability

  • Status: Free Trial

    Skills you'll gain: Extract, Transform, Load, Apache Spark, Data Pipelines, Data Integration, Big Data, Data Infrastructure, Data Processing, Dataflow, Data Management, Data Architecture, Scalability

  • Status: Free Trial

    Skills you'll gain: Apache Spark, Data Pipelines, MLOps (Machine Learning Operations), PySpark, Application Deployment, IBM Cloud, Machine Learning, Containerization, Data Science, Python Programming, Performance Tuning, Scalability

  • Status: Free

    Skills you'll gain: Apache Spark, Data Pipelines, PySpark, Real Time Data, Query Languages, Data Transformation, SQL, Data Processing, Data Analysis

Related roles

Gain the knowledge and skills you need to advance.

  • This role has a £63,732 median salary ¹.

    description:

    A Data Engineer builds data pipelines for large datasets, optimizing systems and ensuring reliable data flow using tools like Hadoop and Spark.

    This role has a £63,732 median salary ¹.

    Offered by

    DeepLearning.AI_logo
    Amazon Web Services_logo
    Google Cloud_logo

Most popular

Trending now

New releases

What brings you to Coursera today?

Leading partners

  • Google Cloud
  • Packt
  • IBM
  • EDUCBA
  • Pearson
  • University of California San Diego
  • Edureka
  • École Polytechnique Fédérale de Lausanne