I believe I have run into this issue with a client. They have a pair of MS425s as their network core, connecting to a pair of Nexus switches for their server access block via a port-channel. The issue seems to repeat every 1 to 3 weeks. All cabing between the switches has been replaced. Intermittently, the Nexus switch starts seeing the ports connecting to the MS425s start flapping in 30s cycles, and eventually, the Nexus switch errdisable's them - permanently. Errdisable recovery will not enable the ports. Disable/enabling the ports does not fix them. Unplugging the 10Gbe TwinAx cable from one of the LACP members, and then plugging it back in again fixes it. I note in the latest MS release notes for 17.2.1 it lists this resolved bug: "All new LAG configurations will block redundant links if the connected device is not configured for LACP. This change fixes an issue where switches would sometimes move LAG ports to an active forwarding state prior to LACP convergence, creating the potential for loops. The change does not apply to existing LAG configurations.". I also take it to mean that after you apply this firmware, you need to delete and then re-create the LACP group.
... View more