Hi,
I've been performing failover testing on a physical stack of two switches (C9300-48U and C9300-24U). During these tests, I observed certain specific behaviors that I would like to confirm are expected based on Meraki's design when running these types of scenarios. For the stack configuration, I relied on the official Meraki documentation for physical stacking of Catalyst switches (https://documentation.meraki.com/MS/Stacking/Switch_Stacks#Stacking_MS390s).
The points I want to validate are the following:
Rejoining a member to the stack:
Is it expected behavior that, when a member switch is added back to the stack, there will be a general loss of communication to the uplinks for approximately 1 to 1.5 minutes?
Meraki Dashboard Display:
When a member switch shuts down or is no longer present in the stack, is it normal for both the active and non-stack switches to appear unreachable in the Meraki dashboard after a certain amount of time even when the active switch remains operational?
Connectivity to the Stack Management IP:
Is it expected behavior that if one of the switches is not present in the stack, there is no pinging to the stack's management IP address, even when the other switch remains up and operational, and even when a host connected to the stack can be reached from another switch on the network? Additionally, is it normal behavior if the host connected to the switch stack for communication, even if a switch is connected, also cannot reach the management IP address of the switch stack?
I also share the steps being applied for failover testing, where I only power down the switches but don't disconnect any stack cables:
1. Simulate a failure on switch 1.
2. Confirm that switch 2 assumes the "Active" role.
3. Reconnect switch 1, verify that its role is "Member" and that switch 2 still has the "Active" role.
4. Simulate a failure on switch 2 and verify that switch 1 assumes the "Active" role again.
5. Reconnect switch 2, verify that its role is "Member," and verify that switch 1 still has the "Active" role.
It's also worth mentioning that I have a port channel that connects a fiber optic port from each switch, which was implemented to ensure high availability of services in case a switch goes offline.
I appreciate your support in confirming whether these behaviors are part of normal stack operation or indicate an anomalous condition.