RTO
Recovery Time Objective: the maximum acceptable downtime after a disaster. An RTO of 15 minutes means the system must be back online within 15 minutes of failure.
What is RTO?
Recovery Time Objective: the maximum acceptable downtime after a disaster. An RTO of 15 minutes means the system must be back online within 15 minutes of failure.
RTO is a advanced concept that sits in the Reliability & Resilience area of system design. Engineers reach for it whenever they need to reason about real-world trade-offs in that space — not just for textbook correctness, but because real production systems at companies like Netflix, Amazon, and Google make these decisions every day.
If you want to go deeper than this definition — with diagrams, code, and a quiz to lock it in — work through the "RTO" lesson linked below. It walks through the why, the mechanism, the trade-offs, and how the giants actually use it in production.
Learn RTO in depth
Full interactive lesson with diagrams, code examples, real-world references, and a quiz.
Open the RTO lessonSee also
Related glossary terms you might want to look up next.
RPO
Recovery Point Objective: the maximum acceptable amount of data loss measured in time. An RPO of 1 hour means you can afford to lose up to 1 hour of data.
Disaster Recovery
A plan and set of procedures for restoring systems after a catastrophic failure. Defined by RPO (how much data you can lose) and RTO (how long you can be down).
Availability
The percentage of time a system is operational and accessible. Measured in 'nines' — 99.99% availability means about 52 minutes of downtime per year.