Reliability Patterns

Resiliency, high availability, availability, and security patterns.

32 topics 5 sections

Resiliency Patterns

7 topics

Additional Topics

9 topics

Distributed Consensus

advanced

Distributed consensus ensures multiple nodes agree on a single value or sequence of operations despite failures and network partitions. Algorithms like Raft and

17 min read

Distributed Locking

advanced

Distributed locks coordinate exclusive access to shared resources across multiple processes or machines, preventing race conditions in distributed systems. Unli

15 min read

Distributed Transactions (2PC)

advanced

Two-Phase Commit (2PC) is a distributed algorithm that ensures all participants in a transaction either commit or abort together, maintaining ACID properties ac

25 min read

Gossip Protocol

advanced

Gossip protocol is an epidemic-style communication pattern where nodes periodically exchange state with random peers, achieving eventual consistency across larg

9 min read

Heartbeat Mechanism

intermediate

Heartbeat mechanisms detect node failures in distributed systems by sending periodic alive signals between nodes. If a node misses several consecutive heartbeats, it is considered failed.

10 min read

Hinted Handoff

advanced

Hinted handoff is a technique in distributed systems where a temporarily unavailable node's writes are stored on a healthy neighbor node with a hint about the intended destination.

29 min read

Leader Election (Resiliency)

intermediate

Leader election is a coordination pattern that designates one node in a distributed system as the authoritative decision-maker, preventing conflicts when multip

13 min read

Queue-Based Load Leveling (Resiliency)

intermediate

Queue-based load leveling inserts a message queue between producers and consumers to absorb traffic spikes, preventing downstream service overload and timeout c

9 min read

Split Brain & Fencing

advanced

Split-brain occurs when network partitions cause multiple nodes to believe they're the leader, leading to conflicting writes and data corruption. Fencing mechan

17 min read