All models?
Both routed and concentrator mode?
I've noticed it mostly in lower-end models like MX68's and MX84's - but there are simply more of these.
Only had to investigate routed mode configurations so far.
Hi @PhilipDAth , yep, this was the same issue that a friend of mine ran into a couple of weeks back and was getting the run around.
They were running 15.44 code, 15.42.3 and 15.42 all with issues namely their auto VPNs dropping every few hours.
@PhilipDAth : Good information sir
I have a mx100 HA pair that is not showing anything in logs with vrrp. Just registry up/down.
I have two mx400 HA pairs going to 15.44 this week and next......🤔
>I have two mx400 HA pairs going to 15.44 this week and next
Let me know how the MX400's go ... I have another customer with a lot of sites hanging off it, and I'm very nervous about letting their upgrade proceed.
Their MX400's are used exclusively for AutoVPN. The others I have been investigating so far all use routed mode and do have Internet traffic flowing through them.
@PhilipDAth- had a MX400 HA pair upgrade last night. I'll just explain the deployment:
Old firmware was a 14.x flavor
This is a DC head-end. Auto-vpn comes in from the north (internet), all internet and non-Meraki traffic heads south into the DC core. All Meraki destine traffic heads back north to other hubs/spokes.
Warm spare, however it is an east/west direct connection (yea yea, I know).
It's been almost 12 hours and the only VRRP transition was during the upgrade. There were some ethernet port carrier logs about 10 minutes later, but it was only a few logged on the primary and have since subsided.
Here is the logs for the location. I filtered on VRRP and port given all the other boring registry stuff that gets logged.
I have a pair of MX450 running 15.44 and I do not see this problem.
>I have a mx100 HA pair that is not showing anything in logs with vrrp.
@Aaron_Wilson , are they being used for Internet access as well, or only AutoVPN?
Autovpn comes in over internet (wan). But any traffic destine for the internet heads south, not hairpin.
Please post follow up if/when you proceed with 15.44 on the mx400s. I've been holding off on my mx400 HA pair upgrade, due mostly to procrastination. But seeing this discussion, I am glad I procrastinated.
2nd set of MX400 warm spares moved to 15.44. Problem free!
Our company has had issue with 15.44 where two separate mx 450 just stopped responding and had to be powercycled. Happened once then again 30 days later.
Looks like we got hit with this today. Our HA pair of MX450's that act as our SD-WAN concentrators started having problems. The primary went completely unresponsive and had to be power cycled to come back online.
The spare took over, and for some reason reported that it had a bad power supply, which it didnt....lights were fine.
Looks like I am going back to 15.43.
Does any one seeing this issue with MX250 ?
Do we have any update on this? I have avoided 15.44 on my MX100 HA pair because of this post.
We went back to Current version: MX 14.56 and have not had any issues since.
I had 6 sets of MX 84/100/400 run the firmware just fine.
We continued to see the MX HA pairs reboot when on MX 15.44. A support case was opened, and Meraki support knew of some memory settings that they could tweak to keep the crash/reboots from occurring. We have not seen the MX HA pairs crashing/rebooting since this memory setting has been tweaked by support. Wish I could tell you exactly what the setting was called, but support wouldnt tell me.
Are you able to give me the case number for this? I am interested in looking into it further.
I sent you a message with the info.
Could you provide it to me as well? I have an open ticket right now regarding this issue.
The fix is in 16.15 as well. We are now running that. So far no issues - but too soon to be sure for certain.
This is the info support gave me in my case.
"You could be hitting a known issue on our MX15 platform which can be addressed with a backend change. Please call us during a maintenance window so we can apply this backend setting. The problem you're seeing is that the device is rebooting and thus triggering a failover, what's causing the behaviour is flow tables being very large on the MX."