System Design, Part 3: Scaling Single Server
The above image is incomplete. How does a client know which server to connect? And suppose, if clients connect to server 1
, and the server reaches its load limit after getting so many clients connected with it and so many requests to process, how to redirect new connections to another server? And how to determine which server will that be?
The solution is, clients don't directly connect to servers, all the connections go through something called Load Balancers.
Load Balancers are also nothing but a server which has the job to efficiently distribute incoming connection requests among the resource pool.
The Load Balancer receives an incoming request and forwards it to an appropriate server. That server is determined on the basis of a certain algorithm, which can be anything depending upon the architect of the load balancer, or the whole system. The Load Balancers keeps a track on how much one server is occupied and redirects new connections to another server.
© progshala.in