C9300L switch stack management container failing

GetoffmyvLAN
Comes here often

C9300L switch stack management container failing

I've been recently struggling with a problem C9300L switch (less than 6 months old) in a two switch stack where its management container stops working but the member switch isn't taking over the management plane.

I've done the following troubleshooting steps and had an active ticket with Meraki support while doing these steps.

  1. Force the member switch (SW2) to become the active switch by rebooting the active switch (SW1). This provided me with two weeks of stable operation
  2. Update the switch firmware to CS 17.2.1.1 which elected SW1 to be the active switch again. We got 20 days of stable operation before its management container stopped working again.

 

The stack is still passing data plane traffic normally, but I'm perplexed as to why our member switch isn't able to takeover management of the stack. While this is only a two switch stack, what happens if I have this issue with a larger switch stack? I have switch stacks ranging from 2 switches and up to 6 switches. If the members of these stacks can't just automagically take over the management of the stack then I'm losing both visibility for monitoring and the ability to make configuration changes.

4 Replies 4
PhilipDAth
Kind of a big deal
Kind of a big deal

Have you tried running the IOS-XE software with native Meraki support and no container?

GetoffmyvLAN
Comes here often

While I'd like to do that, we've got 16 switches in production and that would require us to update to the IOS-XE and we're not quite comfortable with making that jump yet.

On another note the switch came back online after being offline for 28 hours, we're really not sure what is going on but I was able to get the support data bundle this time and I've re-opened my ticket with Meraki support so they can look at the logs and help us figure out what's going on.

Mloraditch
Kind of a big deal

The best and most likely long term fix is what @PhilipDAth said, the old method with the management container is basically deprecated at this point. I imagine there may be a few other fixes for urgent needs, but I would not expect them to work on issues like this with much effort given the roadmap.

If you found this post helpful, please give it Kudos. If my answer solves your problem please click Accept as Solution so others can benefit from it.
cmr
Kind of a big deal
Kind of a big deal

I can confirm that the IOS-XE release is pretty stable on a C9300L, but disabling and enabling ports in LACP aggregations is a bit hit and miss...

If my answer solves your problem please click Accept as Solution so others can benefit from it.
Get notified when there are additional replies to this discussion.
Welcome to the Meraki Community!
To start contributing, simply sign in with your Cisco ID. If you don't yet have a Cisco ID, you can sign up.
Labels