HikeCatalystGet Free Profile Review

← Back to Paths

EXPERT ROADMAP

Apache Kafka & Event Streaming

Apache Kafka & Event Streaming

Build fault-tolerant event-driven systems that process millions of events per second.

CREATED BY

S

Sneha T. ★ 4.9

Business Analyst Lead at ConsultPro | 8+ years of experience

About this Path

For engineers who use Kafka in production but want to master it. Covers internals — log compaction, ISR, exactly-once semantics — alongside Kafka Streams, ksqlDB, schema management, and Kubernetes-native deployment via Strimzi. You will build a real-time analytics pipeline with end-to-end exactly-once guarantees.

Path Overview

Advanced LevelCertificate of CompletionAbout 56 hours to completeEnglish language20+ curated videosLearn online at your own pace6 modules with resourcesGamified & interactive

Path Curriculum

Log Segments, Offsets, and Retention Policies

How Kafka stores messages on disk; time-based vs size-based retention and compaction.

View Resources Start Learning

Leader Election and ISR Protocol

Controller election, in-sync replica set management, and preferred leader rebalancing.

View Resources Start Learning

KRaft Mode — Removing ZooKeeper Dependency

Architecture of the Raft-based metadata quorum and migration from ZooKeeper.

View Resources Start Learning

Network Threads, I/O Threads, and Request Queues

Broker threading model; tuning num.network.threads and num.io.threads for throughput.

View Resources Start Learning

Producer Batching, Compression, and Linger Configuration

Trade latency vs throughput with batch.size, linger.ms, and snappy/lz4/zstd.

View Resources Start Learning

Idempotent and Transactional Producers

Enable exactly-once delivery; understand epoch fencing and transaction coordinator.

View Resources Start Learning

Consumer Group Rebalancing and Cooperative Rebalancing

Static membership, incremental cooperative rebalance, and avoiding stop-the-world pauses.

View Resources Start Learning

Manual Offset Management and Dead Letter Topics

Commit offsets after processing; route poison pills to DLT for async reprocessing.

View Resources Start Learning

Topology DSL — KStream, KTable, GlobalKTable

Model stream-table duality; choose the right abstraction for your join semantics.

View Resources Start Learning

Windowed Aggregations — Tumbling, Hopping, Session

Count and aggregate events in time windows; handle late arrivals with grace periods.

View Resources Start Learning

State Stores and RocksDB Tuning

Persistent and in-memory stores; changelog-backed fault recovery and standby replicas.

View Resources Start Learning

Interactive Queries and REST Service Layer

Expose local state store contents over HTTP for real-time query use cases.

View Resources Start Learning

Avro, Protobuf, and JSON Schema Comparison

Serialisation size, code generation, and schema evolution compatibility rules.

View Resources Start Learning

Confluent Schema Registry — Subjects and Compatibility Modes

Configure BACKWARD, FORWARD, and FULL compatibility; automate schema registration in CI.

View Resources Start Learning

Schema Migration Strategies Without Downtime

Two-phase publish; default field values and namespace aliases for safe evolution.

View Resources Start Learning

Persistent Queries and Push vs Pull Queries

Build real-time materialised views; understand when to use pull queries for low-latency lookup.

View Resources Start Learning

Stream-Table Joins and Enrichment Patterns

Enrich click events with user profiles using co-partitioned stream-table joins.

View Resources Start Learning

Connector Integration with Kafka Connect

Source and sink connectors for Postgres, S3, and Elasticsearch with SMT transforms.

View Resources Start Learning

Strimzi Operator — Kafka Cluster Custom Resources

Define broker, Zookeeper-less cluster, and topic resources declaratively in YAML.

View Resources Start Learning

Rack Awareness, Pod Disruption Budgets, and Rolling Upgrades

Maintain availability during broker restarts; spread replicas across AZs.

View Resources Start Learning

Prometheus JMX Exporter and Grafana Kafka Dashboards

Track consumer lag, under-replicated partitions, produce/fetch latency percentiles.

View Resources Start Learning

Capacity Planning and Partition Count Decisions

Model throughput per partition; avoid over-partitioning pitfalls and coordinator load.

View Resources Start Learning

What you'll learn

✓Tune partitioning strategy and replication factor for throughput, ordering, and fault-tolerance goals.
✓Implement exactly-once semantics using idempotent producers, transactional APIs, and consumer isolation levels.
✓Build stateful stream processing topologies in Kafka Streams with windowed aggregations and changelog topics.
✓Enforce Avro schemas and manage schema evolution without breaking consumers using Confluent Schema Registry.
✓Deploy a multi-broker Kafka cluster on Kubernetes using Strimzi with rack-awareness and rolling upgrades.
✓Monitor consumer lag, under-replicated partitions, and broker skew using Prometheus and Grafana dashboards.

📄 Get Free Profile Review