Networking – How to properly configure the Load Balancing Router

home-networkingnetworkingrouterwireless-networkingwireless-router

Setup Description

I have a TP-Link TL-R470T+ Load Balancing router setup at my home.
Here is a link to the entire setup along with the routers connected etc.
As can be seen from the diagram, I have three different ISPs (for extra redundancy) – all PPPoE connections.

Problem

The main reason I purchased the Load Balancer was that we wanted an uninterrupted connection. However, the connection often goes off for a few seconds to a couple of minutes, kicking users out of zoom call, logging them out of remote servers, etc.
Currently, we've set it up in Load Balancing mode. Link backup hasn't been enabled because that causes only one ISP to be active at a given time. However, we also require the bandwidth, as at any point in time, there are approximately 20 devices connected.

Possible Issues

The possible issues that I can think of are

  1. ISP failure – Very less probable as all thee ISPs will need to fail
    at the same time
  2. Load Balancer Configuration – Some setting in the Load Balancer Configuration I'm missing?
  3. Too many routers/Devices connected – The official documentation of the Load Balancer (Link here) says the device can support up
    to 10,000 connections, while we have only around 20-25 devices
    connected
  4. An issue with the router configuration? – All the routers are set up with the following configurations
    • Mode – access point mode
    • SSID – all have the same SSIDs and passwords
    • Channel Interference – Shouldn't be an issue, as the routers are on different floors, and different ones have been configured for different channels.

Another thing I'd like to know is is it impractical to expect a seamless connection from a load-balancer? (E.g. No dropped frames in a video call even when one of the ISPs goes off). If so, is there a better alternative?

I've called the official technical support group multiple times. They suggest some tweaks like updating firmware, changing the DNS, etc, but most of them haven't produced any good results.
Any help would be appreciated! Thanks in advance!

PS: I'd posted this Question on the Network Engineering Stack Exchange, but was redirected here!

Edit

Adding some of the suspicious entries of the system log

Module Level Content Comment
Firewall WARNING Detected Ping of Death attack. Dropped x packets. (This occurs quite often) This happens often, but is quite spaced out
PPPoE Client WARNING WAN2: PPPoE sending PADI timeout. Happens once in a while
Firewall NOTICE Detected ARP Conflict. Dropped x packets This occurs rarely, but when it occurs, there are multiple similar logs, in a small time interval, leading to loss of many packets
Firewall Warning Detected suspected ARP attack packets. Be careful Don't know what to make of this. Comes once in a while

Hope this can throw some additional light!

Best Answer

I'll only answer this part first:

is it impractical to expect a seamless connection from a load-balancer? (E.g. No dropped frames in a video call even when one of the ISPs goes off). If so, is there a better alternative?

With a basic "load balancer" this is very difficult to achieve as far as I know, simply because each of the 3 ISPs requires you to use a different IP address to access the Internet.1

So whenever the link through ISP 1 goes down, all TCP connections made using that ISP's IP address are as good as gone – there is no way to migrate them to a different endpoint at all, much less without losing any packets. The same also applies to most UDP-based applications (there are a few which support endpoint roaming, but most video streaming protocols do not).

It's quite possible (though I have no way to confirm) that it is actually an unexpected ISP switch that causes all existing connections to die. You should watch your router's system logs or check what is being reported as your "public" IP address from the affected computer.

One possible workaround is to set up a VPN that uses a roaming-capable protocol (WireGuard in particular works well), and always make the video calls through that VPN. This works because your video connection will be using the VPN server's IP address the whole time, regardless of which ISP you're using to reach that server.

(But even with a roaming-capable protocol, there can still be some packet loss – if the link to the current ISP dies (i.e. switchover is unexpected), the server will not be aware of what just happened on your end and will just continue sending packets to your old address until it gets informed of the new endpoint. The protocol needs to be designed to buffer and retransmit if necessary.)


1 (Large enterprise networks use BGP to share the same IP addresses across multiple ISP connections, but this can only be done with the ISPs' cooperation and has quite a few additional costs.)

Related Question