Hi all, bear with me, it's a bit of a long one.
We have a pair of MX-250's in HA as Primary/Backup. They sit in two buildings, each with their own subnet and VLAN. Both subnets and VLANs are piped into both MX's across our site wide fibre link so they can work their magic with HA. They have been on the LAN for over 2 weeks now and on Monday evening they were swapped into place of our old firewalls, essentially shutting ports on the internet switch and opening the ones for the MX's.
Since then, we've had a really weird random issue with two other routers onsite. They are a pair of third party managed Cisco routers, each with their own dedicated external circuit and an interface in each of the buildings VLAN's and they have been running as they are for over a year without issue. They are configured with HSRP for both VLAN's so our default gateways have a single route in each VLAN to drop traffic on them and they route it to our other offices over MPLS.
Since Monday night, randomly, we loose the ability to reach the Cisco routers so we lose access to other office resources. The only fix I've found is to shut and open the port connecting the Cisco router to the LAN. There's nothing logged on either the switch or the router before the failures begin, all they show is the port state changing as I do it. The third party have been all over their routers and can't find a fault and I'm in the same boat with the switches they connect to.
The only recent change was making the MX's our default route to the internet (not to the sites supported by the Cisco routers) on Monday night so I'm certain it's connected but I've been banging my head on this one since I made the MX's live on Monday evening and I'm just not getting anywhere, hopefully someone has seen it before and can nudge me in the right direction.