Site Reliability Engineering

Site Reliability Engineering (SRE) is a discipline that incorporates aspects of software engineering and applies them to infrastructure and operations problems to create scalable and reliable software systems. Coursera's SRE catalogue equips you with the principles of SRE, including service level objectives, error budgets, and automation. You'll learn about the design, deployment, and maintenance of large-scale, efficient, and reliable software systems. By understanding incident management, disaster recovery, and creating monitoring systems, you can enhance system reliability and efficiency, making you valuable to any company that relies on robust software infrastructure.
8credentials
26courses

Filter by

Subject
Required

Language
Required

The language used throughout the course, in both instruction and assessments.

Learning Product
Required

Build job-relevant skills in under 2 hours with hands-on tutorials.
Learn from top instructors with graded assignments, videos, and discussion forums.
Get in-depth knowledge of a subject by completing a series of courses and projects.
Earn career credentials from industry leaders that demonstrate your expertise.

Level
Required

Duration
Required

Subtitles
Required

Educator
Required

Results for "site reliability engineering"

  • Status: Preview

    Skills you'll gain: Site Reliability Engineering, Service Level, DevOps, System Monitoring, Continuous Monitoring, Risk Management Framework, Cloud Computing, Software Documentation

  • Status: Free Trial

    Skills you'll gain: Application Deployment, Cloud Infrastructure, CI/CD, Cloud Security, Service Level Agreement, Microservices, Service Level, Google Cloud Platform, Network Architecture, API Design, Site Reliability Engineering, Cloud Computing Architecture, Kubernetes, Restful API, Cloud Storage, Cloud Computing, Key Performance Indicators (KPIs), DevOps, System Design and Implementation, Disaster Recovery

  • Status: Free Trial

    Skills you'll gain: Site Reliability Engineering, Safety Culture, Culture Transformation, CI/CD, Service Level, System Implementation, Performance Measurement, Data-Driven Decision-Making, Organizational Structure, Incident Management, Automation, Change Management, Goal Setting

  • Status: Free Trial

    Skills you'll gain: Site Reliability Engineering, Kubernetes, Application Performance Management, Google Cloud Platform, Cloud Infrastructure, System Monitoring, Prompt Engineering, Application Deployment, Identity and Access Management, CI/CD, Containerization, Cloud Storage, Cloud Security, Cloud Services, Cloud Management, Service Level Agreement, Virtual Machines, Safety Culture, Network Monitoring, Culture Transformation

  • Status: Free Trial

    Skills you'll gain: Site Reliability Engineering, Infrastructure as Code (IaC), Google Cloud Platform, Application Deployment, Cloud Services, Cloud Computing Architecture, Identity and Access Management, Google App Engine, Kubernetes, CI/CD, Cloud Management, Cloud Storage, Real Time Data, Cloud Infrastructure, Cloud Solutions, Load Balancing, Cloud Computing, Big Data, Network Monitoring, Cloud Security

  • Status: Free Trial

    Skills you'll gain: Data Visualization Software, PySpark, Data Visualization, Snowflake Schema, Data Storytelling, Site Reliability Engineering, Docker (Software), Databricks, Containerization, Interactive Data Visualization, Plotly, Data Pipelines, Matplotlib, Kubernetes, Dashboard, Apache Spark, Apache Hadoop, Big Data, Data Science, Python Programming

What brings you to Coursera today?

  • Status: Free Trial

    Skills you'll gain: Site Reliability Engineering, Docker (Software), Containerization, Kubernetes, Virtualization, Devops Tools, Microservices, Development Environment, Software Development Tools, Application Deployment, Virtual Machines, Software Development, Cloud Development, Database Management, GitHub, Cloud-Based Integration, Scalability

  • Status: New
    Status: Free Trial

    Skills you'll gain: Generative AI, Kubernetes, Containerization, Docker (Software), Cloud Infrastructure, Scalability, Prompt Engineering, MLOps (Machine Learning Operations), Large Language Modeling, Infrastructure Architecture, Performance Tuning, Application Deployment, Site Reliability Engineering, Enterprise Architecture, Continuous Deployment, Continuous Monitoring, Technology Strategies, Process Optimization, Automation, Job Evaluation

  • Status: Free Trial

    Skills you'll gain: Business Transformation, Site Reliability Engineering, Innovation, Digital Transformation, Serverless Computing, Application Programming Interface (API), Technology Strategies, Hybrid Cloud Computing, Safety Culture, Data Strategy, Organizational Change, Change Management, Cloud Infrastructure, Cloud Solutions, Google Cloud Platform, Culture Transformation, People Management, Cloud Computing, CI/CD, Service Level

  • Status: Free Trial

    Skills you'll gain: Cloud Security, Serverless Computing, Cloud Management, Application Programming Interface (API), Artificial Intelligence and Machine Learning (AI/ML), Containerization, Encryption, Google Cloud Platform, Data Strategy, Cloud Infrastructure, Site Reliability Engineering, Digital Transformation, Google App Engine, Real Time Data, Cloud Computing, Data Security, Data Governance, Cloud-Native Computing, Cloud Services, Artificial Intelligence

  • Status: Free Trial

    Skills you'll gain: Cloud Management, Cloud Security, Serverless Computing, Site Reliability Engineering, Network Security, Containerization, Application Programming Interface (API), Google Cloud Platform, Digital Transformation, Real Time Data, Cloud Infrastructure, Hybrid Cloud Computing, Data Strategy, Encryption, Cloud Services, Data Security, Cost Management, Data Governance, Cloud Computing, Responsible AI

  • Status: Free Trial

    Skills you'll gain: Application Deployment, Cloud Infrastructure, CI/CD, Cloud Computing Architecture, Cloud Security, Microservices, Service Level Agreement, Kubernetes, Site Reliability Engineering, Google Cloud Platform, Cloud Storage, Key Performance Indicators (KPIs), Network Architecture, Restful API, API Design, Systems Architecture, Scalability, Load Balancing, System Monitoring, Disaster Recovery