Apache Hadoop

Apache Hadoop is an open-source software framework used for distributed storage and processing of large datasets across clusters of computers. Coursera's Apache Hadoop catalogue teaches you about the core concepts and components of this powerful framework. You'll learn about Hadoop's architecture, its key components like Hadoop Distributed File System (HDFS) and MapReduce, as well as advanced topics such as data ingestion with tools like Flume and Sqoop. You will also delve into data processing using Hive and Pig, and explore scalable machine learning algorithms. By mastering Apache Hadoop, you will be equipped to handle big data challenges, contributing to business insights and decision making.
27credentials
64courses

Filter by

Subject
Required

Language
Required

The language used throughout the course, in both instruction and assessments.

Learning Product
Required

Learn from top instructors with graded assignments, videos, and discussion forums.
Learn a new tool or skill in an interactive, hands-on environment.
Get in-depth knowledge of a subject by completing a series of courses and projects.
Earn career credentials from industry leaders that demonstrate your expertise.

Level
Required

Duration
Required

Subtitles
Required

Educator
Required

Explore the Mapreduce Course Catalog

  • Status: Free Trial

    University of California San Diego

    Skills you'll gain: Big Data, Apache Hadoop, Scalability, Data Processing, Data Science, Distributed Computing, Unstructured Data, Data Infrastructure, Data Analysis

  • University of California San Diego

    Skills you'll gain: Apache Hadoop, Big Data, Data Analysis, Apache Spark, Data Science, Data Processing, Distributed Computing, Performance Tuning, Scalability, Data Storage, Python Programming

  • Status: New
    Status: Free Trial

    Skills you'll gain: Apache Kafka, Apache Spark, Apache Hadoop, Scala Programming, Real Time Data, Apache Hive, Command-Line Interface, Distributed Computing, Data Processing, Big Data, Apache, Apache Cassandra, Applied Machine Learning, Data Pipelines, Java, Databases, MongoDB, IntelliJ IDEA, NoSQL, Application Deployment

  • Status: New
    Status: Free Trial

    Skills you'll gain: PySpark, Apache Spark, MySQL, Data Pipelines, Scala Programming, Extract, Transform, Load, Customer Analysis, Apache Hadoop, Classification And Regression Tree (CART), Predictive Modeling, Applied Machine Learning, Data Processing, Advanced Analytics, Big Data, Apache Maven, Statistical Machine Learning, Unsupervised Learning, SQL, Apache, Python Programming

  • Status: Free Trial

    Johns Hopkins University

    Skills you'll gain: Apache Hadoop, Big Data, Apache Hive, Apache Spark, NoSQL, Data Infrastructure, File Systems, Data Processing, Data Management, Analytics, Data Science, SQL, Query Languages, Data Manipulation, Java, Data Structures, Distributed Computing, Scripting Languages, Data Transformation, Performance Tuning

  • Status: New
    Status: Free Trial

    Skills you'll gain: Apache Kafka, Data Warehousing, Extract, Transform, Load, Microsoft SQL Servers, Snowflake Schema, Star Schema, Performance Tuning, Data Pipelines, Cloud Computing Architecture, Business Intelligence, Real Time Data, Apache Hadoop, Data Modeling, Data Quality, Responsible AI, Apache Spark, SQL, Generative AI, Data Governance, Quality Management

What brings you to Coursera today?

  • Status: Free Trial

    Skills you'll gain: Apache Hadoop, Apache Spark, PySpark, Apache Hive, Big Data, IBM Cloud, Kubernetes, Docker (Software), Scalability, Data Processing, Distributed Computing, Performance Tuning, Data Transformation, Debugging

  • Status: Free Trial

    Skills you'll gain: Big Data, Data Analysis, Statistical Analysis, Apache Hadoop, Apache Hive, Data Collection, Data Science, Data Warehousing, Data Visualization, Data Cleansing, Apache Spark, Data Lakes, Data Visualization Software, Relational Databases, Microsoft Excel

  • Status: New
    Status: Free Trial

    Skills you'll gain: PySpark, Apache Hadoop, Apache Spark, Big Data, Apache Hive, Data Lakes, Analytics, Data Pipelines, Data Processing, Data Import/Export, Data Integration, Linux Commands, Data Mapping, Linux, File Systems, Text Mining, Data Management, Distributed Computing, Java, C++ (Programming Language)

  • Status: New
    Status: Free Trial

    Skills you'll gain: Apache Spark, Scala Programming, Apache Hadoop, Apache Maven, Real Time Data, Data Processing, Scalability, Data Structures, Object Oriented Programming (OOP), Systems Integration

  • Status: New
    Status: Free Trial

    Skills you'll gain: Extract, Transform, Load, Apache Spark, Data Pipelines, PySpark, Apache Hadoop, Data Transformation, MySQL, Data Manipulation, Java Platform Enterprise Edition (J2EE), Data Store, Data Import/Export, Development Environment, Software Installation, System Configuration

  • Status: Free Trial

    Skills you'll gain: Apache Hadoop, Data Processing, Distributed Computing, Performance Tuning, Big Data, Software Architecture, Scalability, Java, System Configuration