Daily and Weekly VPN Drops But Probably Caused by Remote Device

MarcW
Here to help

Daily and Weekly VPN Drops But Probably Caused by Remote Device

I have 10 MX's that all talk to the same remote device.  Many mornings, I will lose some of my site to site VPN connections to the non-Meraki peer. 

 

What can I change/ extend such that the MX's will keep trying longer to maintain the connection even if it is the remote device dropping it?  Usually, a power cycle on either end reestablishes the connection. 

 

 

11 Replies 11
Tony-Sydney-AU
Meraki Employee All-Star Meraki Employee All-Star
Meraki Employee All-Star

Hi @MarcW ,

 

Sounds like your non-Meraki peer is turning the tunnels off due to no activity.

 

Meraki devices send special DPD vpn packets to keep the tunnels alive when there's no traffic. However, not all VPN peers reply to DPD.

 

Therefore, the best way to keep site-to-site tunnels UP is generating interesting traffic - e.g.: a continuous ping from a host inside a local subnet into a destination at the other side.

If you found this post helpful, please give it kudos. If my answer solved your problem, click "accept as solution" so that others can benefit from it.
RaphaelL
Kind of a big deal
Kind of a big deal

Hi ,

 

This is the 3rd time I'm seeing this. MarcW , Myself and https://community.meraki.com/t5/Security-SD-WAN/Traffic-not-getting-initiated-from-IKEv1-and-IKEv2-f...

 

Are you running MX 18.2 by any chances ?

MarcW
Here to help

4 of the 5 are due for the update.  Coordinating that change with my local admin as we speak. 

 

What we saw were UPLINK notices almost daily for a mix of MXs.  When I started looking for events, there were no fails.  When I called the local admin, he said he'd been using the reboot as a workaround to address the issue.  I was relieved that the MXs weren't "failing" several times a week - lol - but I had to then change my approach after identifying the real issue. 

 

After I update the firmware, we'll see if there are any latency/ packet drop accommodating settings to implement. 

cmr
Kind of a big deal
Kind of a big deal

How about putting an MX at the other end? 😉

If my answer solves your problem please click Accept as Solution so others can benefit from it.
MarcW
Here to help

🙂 if only they were our direct client. 

MarcW
Here to help

Firmware updated - no joy

Continuous ping - no joy

 

At this point, with the same behavior across 8+ site to site connections to a single destination, dropping the same way (and possibly same happening to other businesses) and needing reboots every other day or so, I am going to conclude it is them, not us.  Thanks!!

RaphaelL
Kind of a big deal
Kind of a big deal

What version were you running prior to the upgrade ?

MarcW
Here to help

18.211

RaphaelL
Kind of a big deal
Kind of a big deal

Great. So every one on MX 18.107 or 18.211 seems to be having the same issues. Somehow support doesn't know that it is affecting multiple clients. Curious.

NetworkNetwork
New here

Non Meraki Peer To another Cisco

Used to be Intermittent drops a couple times a year

Now dropping daily

Only solved by rebooting the MX

All Meraki to Meraki tunnels are always happy, just the non Meraki peer drops

So far support not much help - recommended firmware, no change, currently on 18.107.12

Anyone find a solution? Or at least an automated way to bounce just the non Meraki VPN?

Tony-Sydney-AU
Meraki Employee All-Star Meraki Employee All-Star
Meraki Employee All-Star

Hi @NetworkNetwork , thanks for bringing this matter here.

 

If your tunnel is going down every day, it's usually a racing condition in either Phase-1 or Phase-2 negotiation and SA database; or DPD timers like I mentioned in my first reply.

 

In some edge cases, it might be related to packet loss in transit between peers.

 

I would recommend you check the time of day when it goes down and try to find a pattern.

 

In addition, run MTR with 20 cycles to that Cisco peer right after the the tunnel is down - before even trying to reestablish tunnel.

 

Doing MTR would tell you if there is some level of packet loss.

 

After identifying which hop has packet loss, check in a WHOIS tool to find if that hop is within your provider network or some other provider Autonomous System (AS).

 

If you found this post helpful, please give it kudos. If my answer solved your problem, click "accept as solution" so that others can benefit from it.
Get notified when there are additional replies to this discussion.