AFF A400, FAS8300, FAS8700 shuts down with MULTIPLE CHASSIS FAN FAILED: System will shut down in 2 minutes
Applies to
- ONTAP 9
- AFF systems
- ASA systems
- FAS systems
- FAN module
Issue
- Node shuts down with one or more of the following AutoSupport alerts:
HA Group Notification (MULTIPLE CHASSIS FAN FAILED: System will shut down in 2 minutes) ERROR
HA Group Notification (Health Monitor process cphm: CriticalFruMultiFaultAlert[xxxxxxxxxxxx]) ALERT
- The following errors are seen in the event logs:
[Node-02: env_mgr: monitor.chassisFan.stop:error]: Chassis fan contains at least one stopped fan: Fan4_1 (failed)
[Node-02: env_mgr: monitor.chassisFan.stop:error]: Chassis fan contains at least one stopped fan: Fan4_2 (failed)
[Node-02: env_mgr: monitor.chassisFanFail.xMinShutdown:EMERGENCY]: Multiple Chassis Fan failure: System will shut down in 2 minutes.
[Node-02: monitor: monitor.globalStatus.critical:EMERGENCY]: Multiple fans has failed: SysFan4 F2, SysFan4 F1.
[Node-02: env_mgr: callhome.c.fan.fru.fault:error]: Call home for CHASSIS FAN FRU FAILED: Fan4_1
[Node-02: env_mgr: callhome.c.fan.fru.fault:error]: Call home for CHASSIS FAN FRU FAILED: Fan4_2
- No readings are seen for the sensors of the affected fan module in
SP-LATEST-IPMIsection of AutoSupport:
Fan1_1 | 70h | ok | 29.1 | 12600 RPM
Fan2_1 | 71h | ok | 29.2 | 12600 RPM
Fan3_1 | 72h | ok | 29.3 | 12600 RPM
Fan4_1 | 73h | ns | 29.4 | No Reading
Fan1_2 | 74h | ok | 29.1 | 12600 RPM
Fan2_2 | 75h | ok | 29.2 | 12600 RPM
Fan3_2 | 76h | ok | 29.3 | 12600 RPM
Fan4_2 | 77h | ns | 29.4 | No Reading
Troubleshooting attempted
- Attempted to reseat the suspicious fan module
- Swapped the suspicious fan module with a known working one. For example: Swap fan module A1 and fan module A3
- Rebooted and alert continues
- Issue follows the fan
