Storage Bridge Unreachable Intermittently Due to Management Network Flapping
Applies to
- ONTAP 9
- MetroCluster FC
- ATTO FibreBridge 7500N / 7600N
Issue
- Intermittent “
StorageBridgeUnreachable_Alert” reported by ONTAP Health Monitor (schm) on FAS systems. - Alerts are raised and cleared within a short period. Logs indicate e0M port flapping on the affected nodes, with the bridge becoming unreachable during these events.
- Sample log output:
Wed Nov 26 14:18:41 +0100 [Node-01: mgmt_port_link_status_poll: netif.linkDown:info]: Ethernet Wrench Port: Link down, check cable.Wed Nov 26 14:18:41 +0100 [Node-01: vifmgr: vifmgr.portdown:notice]: A link down event was received on node Node-01, port e0M.Wed Nov 26 14:18:41 +0100 [Node-01: vifmgr: vifmgr.lifmoved.linkdown:notice]: LIF Node-01_mgmt1 (on virtual server 000000000), IP address 10.XXX.XXX.XX, is being moved to node Node-01, port e0M.Wed Nov 26 14:20:13 +0100 [Node-01: mgwd: mcc.bridge.error:debug]:Wed Nov 26 14:21:47 +0100 [Node-01: schmd: hm.alert.raised:alert]: Alert Id = StorageBridgeUnreachable_Alert , Alerting Resource = 20000a0a0a0a0a0a raised by monitor system-connectWed Nov 26 14:21:47 +0100 [Node-01: schmd: hm.alert.raised:alert]: Alert Id = StorageBridgeUnreachable_Alert , Alerting Resource = 20000b0b0b0b0b0b raised by monitor system-connect
- The link has automatically come up after 1 minute:
Wed Nov 26 14:21:28 +0100 [Node-01: mgmt_port_link_status_poll: netif.linkUp:info]: Ethernet Wrench Port: Link up.Wed Nov 26 14:21:28 +0100 [Node-01: vifmgr: vifmgr.portup:notice]: A link up event was received on node Node-01, port e0M.Wed Nov 26 14:21:31 +0100 [Node-01: vifmgr: vifmgr.lifsuccessfullymoved:notice]: LIF Node-01_mgmt1 (on virtual server 000000000), IP address 10.XXX.XXX.XX, is now hosted on node Node-01, port e0M.Wed Nov 26 14:30:13 +0100 [Node-01: mgwd: mcc.bridge.error.cleared:debug]:Wed Nov 26 14:31:47 +0100 [Node-01: schmd: hm.alert.cleared:notice]: Alert Id = StorageBridgeUnreachable_Alert , Alerting Resource = 20000a0a0a0a0a0a cleared by monitor system-connectWed Nov 26 14:31:47 +0100 [Node-01: schmd: hm.alert.cleared:notice]: Alert Id = StorageBridgeUnreachable_Alert , Alerting Resource = 20000b0b0b0b0b0b cleared by monitor system-connect
- The same pattern was observed on multiple occasions. No other system health alerts were present.
