Load Balancer

LB is a service that distributes incoming network traffic across multiple servers to ensure no single server becomes overwhelmed, improving application availability and responsiveness.

Load balancers can operate at different layers of the OSI model, such as Layer 4 (Transport Layer) and Layer 7 (Application Layer), providing various features like SSL termination, session persistence, and health monitoring of backend servers.

Common Load Balancing Algorithms

Round Robin: Distributes requests sequentially across the servers.
Least Connections: Directs traffic to the server with the fewest active connections.
IP Hash: Uses the client’s IP address to determine which server will handle the request.
Weighted Round Robin: Similar to Round Robin but allows assigning weights to servers based on their capacity.
Random: Distributes requests randomly across the servers.
Metric-Based: Uses specific metrics (like response time or server load) to make load balancing decisions.

Why you need to know this?

Backend services must have a load balancer in front of them. This is a good pattern, because it allow us to better use infrastructure resources and improve availability of our services. Usually the default if you are using AWS is to have a Application Load Balancer in front of your backend services. In case you have a lot of traffic you can also use a Network Load Balancer.

Keyboard shortcuts

Diego Pacheco's Software Architecture Library

Load Balancer

Common Load Balancing Algorithms

Why you need to know this?