Throughput
The number of operations a system can handle per unit of time. Think of it as how many cars a highway can move per hour.
What is Throughput?
The number of operations a system can handle per unit of time. Think of it as how many cars a highway can move per hour.
Throughput is a foundational concept that sits in the Core Fundamentals area of system design. Engineers reach for it whenever they need to reason about real-world trade-offs in that space — not just for textbook correctness, but because real production systems at companies like Netflix, Amazon, and Google make these decisions every day.
If you want to go deeper than this definition — with diagrams, code, and a quiz to lock it in — work through the "Throughput" lesson linked below. It walks through the why, the mechanism, the trade-offs, and how the giants actually use it in production.
Learn Throughput in depth
Full interactive lesson with diagrams, code examples, real-world references, and a quiz.
Open the Throughput lessonRelated lessons
Lessons that touch on Throughput as part of a larger topic.
Adaptive Bitrate Streaming
Automatically adjusting video quality based on the viewer's network speed, smooth playback across all connections
intermediate · web content delivery
Connection Reuse
Pool and reuse connections across your application to maximize throughput and minimize overhead
intermediate · api design protocols
TCP Optimization
Tuning TCP parameters and techniques to maximize throughput and minimize latency for cloud workloads
intermediate · cloud infrastructure
Data Locality
Store data close to where it's accessed to slash latency and boost throughput
intermediate · data replication distribution
Performance Testing
Measuring and optimizing response times, throughput, and resource usage to meet performance SLAs
intermediate · devops cicd
See also
Related glossary terms you might want to look up next.
Latency
The time delay between sending a request and getting a response. Amazon found every 100ms of extra latency costs 1% in sales.
Bandwidth
The maximum amount of data that can be transferred over a network in a given time. It's the width of the pipe, not how fast the water flows.
Horizontal Scaling
Adding more machines to handle increased load (scaling out). Like opening more checkout lanes instead of making one cashier faster.