When one PSU is failed on R730/R740/R640 model, Power Supply Failure" alarm does not fire in 11.3.2 Health & Wellness while R620 model is no issue. Here are some examples between problematic host and working host.
R640 model: Problematic host
# ipmitool sdr type "Power Supply"
PS Redundancy | 77h | ok | 7.1 | Redundancy Lost
Status | 85h | ok | 10.1 | Presence detected
Status | 86h | ok | 10.2 | Presence detected, Power Supply AC lost
System Stats Browser detected Power Supply Status as "ok" as shown below and this is unexpected value. Expected value is "Presence detected".
PSU monitoring script(/usr/lib/collectd/python/nwsysinfo.py) in 11.3.2 does not have code checking mechanism that is implemented with respect to Series 5 or above versions.
NW version 18.104.22.168 above, this issue was fixed.
You need to follow the steps below on the NW server and for one of the core appliances for which this issue is observed in 11.3.2.x.
Take a backup of /usr/lib/collectd/python/nwsysinfo.py
Replace the file with the attached nwsysinfo.py
Delete the existing nwsysinfo.pyc file.
Please make an observation if the stats are observed after this change.
If you are unsure of any of the steps above or experience any issues, contact RSA Support and quote this article number for further assistance.