Reliability in event-driven systems is not a property you buy from the broker and then stop thinking about.
Reliability in event-driven systems is not a property you buy from the broker and then stop thinking about. Delivery labels, retry strategy, dead-letter handling, and failure isolation all shape whether the platform degrades safely or turns small faults into duplicated business effects and stuck work.
Read the lessons in order. The first separates transport guarantees from application-level correctness. The second focuses on retry policy, including backoff and jitter. The third explains dead-letter queues and poison-message handling. The fourth turns to failure isolation and partial success, which is where many production incidents become harder than the happy path suggests.
If a team can describe how events flow when everything works but cannot explain what happens under duplicate delivery, repeated failure, or mixed per-record outcomes, Chapter 7 is where that reliability model needs to become explicit.