HikeCatalystHikeCatalyst
← Back to Paths
[PLACEHOLDER hero banner]

Become a DevOps / SRE / Platform Engineer

Own the full delivery platform: from Kubernetes internals to on-call SLO engineering at scale.

CREATED BY
M
Meera P. [PLACEHOLDER] 5.0
Team Lead, Data Science at AdMetrics | 9+ years of experience

About this Path

Built for engineers moving into senior DevOps, SRE, or Platform Engineering roles at companies running Kubernetes in production. You will master reliability engineering with error budgets and SLOs, design multi-cluster Kubernetes platforms, build GitOps pipelines, and lead incident response—preparing for staff-level and principal interviews.

Path Overview

Advanced LevelCertificate of CompletionAbout 72 hours to completeEnglish language22+ curated videosLearn online at your own pace6 modules with resourcesGamified & interactive

Path Curriculum

Control Plane Deep Dive: etcd, API Server, Scheduler
Watch streams, admission webhooks, scheduling profiles and node affinity.
Workload Reliability: PodDisruptionBudgets, HPA, KEDA
VPA trade-offs, event-driven scaling with KEDA, graceful termination patterns.
Network Policies, Service Mesh & mTLS with Istio
eBPF-based CNI vs iptables, Istio AuthorizationPolicy, traffic shifting.
Kubernetes Security: RBAC, Pod Security & Image Supply Chain
OPA Gatekeeper, Sigstore/Cosign image signing, secrets encryption at rest.
Defining SLIs, SLOs and Error Budget Policies
Alerting on burn rates, multi-window alerts, CUJ mapping to service endpoints.
Prometheus Operator, Recording Rules & Alertmanager
Sharding, federation, cardinality management, silencing vs inhibition rules.
OpenTelemetry Instrumentation & Distributed Tracing
Auto-instrumentation agents, trace sampling strategies, Jaeger vs Tempo.
Grafana Dashboards & SLO-centric Incident Dashboards
USE/RED method panels, SLO burn-rate dashboards, Grafana Mimir long-term storage.
ArgoCD ApplicationSets & App-of-Apps Patterns
Multi-cluster sync, RBAC, health checks, sync hooks and waves.
Canary & Blue-Green Deployments with Argo Rollouts
Analysis templates, metric-based promotion gates, automated rollback.
GitHub Actions: Reusable Workflows & OIDC Auth
Matrix builds, composite actions, keyless auth to AWS/GCP via OIDC tokens.
Artifact Management: Helm OCI, Cosign & SBOM Generation
Helm chart signing, SLSA provenance, Syft SBOM attached to container images.
Terraform Modules, Remote State & Workspace Strategy
State locking patterns, module registry, moved blocks for safe refactoring.
Atlantis & Drift Detection Workflows
PR-based plan/apply, Driftctl/Terragrunt, config connector for GKE.
OPA/Conftest & Sentinel Policy Enforcement
Rego rules for Terraform and Kubernetes, CI gate integration, policy exceptions.
Crossplane & Kubernetes-native IaC
Composite resource definitions, provider configs, GitOps-driven cloud provisioning.
Game Days: Designing Chaos Experiments with Litmus
Steady-state hypothesis, fault injection scopes, results analysis and remediation.
On-Call Runbooks, Alert Fatigue & Escalation Policies
PagerDuty routing, runbook automation with Rundeck, toil reduction metrics.
Blameless Postmortem Facilitation
Five-whys vs fishbone, contributing factors, action items with owners and SLAs.
IDP Design: Golden Paths, Self-Service & Paved Roads
Platform-as-product mindset, cognitive load reduction, adoption metrics.
Backstage: Catalog, TechDocs & Software Templates
Entity model, scaffolding plugins, Kubernetes plugin for live service health.
FinOps: K8s Cost Attribution & Rightsizing
Kubecost namespaced reports, LimitRange defaults, spot interruption handling.
Platform Engineering Interviews: System Design Scenarios
Multi-region failover, multi-tenant Kubernetes, CI/CD at 1,000-engineer scale.

What you'll learn

  • Design and operate multi-cluster Kubernetes platforms with autoscaling, network policies, and secure multi-tenancy.
  • Define and enforce SLOs, error budgets, and alerting strategies using Prometheus, Grafana, and OpenTelemetry.
  • Build GitOps delivery pipelines using ArgoCD or Flux, progressively rolling out changes with canary and blue-green strategies.
  • Automate infrastructure provisioning and drift detection with Terraform, Atlantis, and policy-as-code via OPA/Conftest.
  • Lead structured incident response: on-call runbooks, chaos engineering with Chaos Monkey and Litmus, and blameless postmortems.
  • Architect platform abstractions using Internal Developer Platforms, Backstage, and golden-path templates that accelerate product teams.
FREE PROFILE AUDIT

Book your free audit

Tell us where you are — a senior mentor reviews your profile and shows you exactly what's blocking interview calls. Only name, email and role are required; the more you share, the sharper your audit. No spam, no obligation.

A FEW MORE DETAILS (OPTIONAL)
I want

* required · Prefer talking? WhatsApp +91 83598 96054 or email connect@hikecatalyst.com

📄 Score My Resume