This document proposes an improved weighted least connection (IWLC) scheduling algorithm to balance load among web servers in a cluster system. It aims to prevent overloading newly added servers by not assigning requests to a new server more than a maximum number of times (C) consecutively. If a new server receives more than C requests consecutively, it is deactivated and excluded from scheduling for C-1 rounds. Then it is reactivated and included in the scheduling list. The algorithm balances load by avoiding overloads on new servers. The performance of the IWLC algorithm is evaluated through a simulation using Docker containers to create virtual web servers handling simultaneous client requests sent via Node.js. Results show the proposed algorithm more evenly distributes