MetroCluster IP iWarp interconnect to BES-53248 switch port offline during 9.9.1 ANDU
Applies to
- MetroCluster IP
- AFF-A300 / FAS8200
- ONTAP 9.9.1
- Automated Non Disruptive Update (ANDU)
- BES-53248 firmware 3.4.4.6 and 3.7.0.4
Issue
- During ANDU the first node reboot to 9.9.1 the node is stuck in "waiting for reservations to clear" as seen in SP console.
Waiting for reservations to clear
Oct 09 16:18:17 [cluster-02:cf.disk.ResvFail:ALERT]: Disk 0a.30.13P3 has been reserved by the High Availability (HA) partner as part of a takeover operation.
Oct 09 16:18:17 [cluster-02:cf.disk.ResvTakeOver:notice]: This node will wait for giveback and the disk reservations to be released.
- Visual inspection of the BES-53248 switch shows the storage ports
e1a
ande1b
to switch ports0/5
and0/6
are dark, no link light, on one or both switches.
- Validate Interconnect state is down using node shell on up node.
cluster-02*> ic status
Warning: This command only operates on the HA adapter.
Link 0: down
Link 1: down
IC RDMA connection : down
cluster-02*> cf status
cluster-01 is up, takeover disabled because of reason (HA interconnect error. Verify that the partner node is running and that the HA interconnect cabling is correct, if applicable.
cluster-02 has disabled takeover by cluster-01 (HA interconnect error. Verify that the partner node is running and that the HA interconnect cabling is correct, if applicable.
- Validate storage failover states interconnect error
cluster::*> storage failover show
(storage failover show)
Takeover
Node Partner Possible State Description
-------------- -------------- -------- -------------------------------------
cluster-01
cluster-02 false cluster-02, Takeover is not possible:
Storage failover interconnect error,
NVRAM log not synchronized, Disk
inventory not exchanged
cluster-02
cluster-01 false cluster-01 Takeover is not possible:
Storage failover interconnect error,
NVRAM log not synchronized,
Disk inventory not exchanged
2 entries were displayed.