mytelecoms
Vendor
Hi folks,
There's a big star for anyone who can help with this one.
We have 12 IBM BladeCenter chassis. All are 8677 variants.
Each chassis is fully loaded with blades, modules and power supplies.
Each power domain (two PSUs) in each chassis is connected to a different distribution block in the cab. The distribution blocks in each cab are on different phases.
We have four cabs - 3 chassis per cab.
We're getting alerts on a regular basis of a one or two second interval where power appears to have been cut for a power supply in one of the domains. The messages are as follows:
Power modules are nonredundant in domain x
This is happening on several chassis which are in different cabs, and there is no clear pattern to it.
Having pushed and pushed the people in our data suite, they tell me that there is nothing wrong with the supply. I am less convinced, but was willing to look elsewhere. So, I upgraded the MM firmware on every chassis to 1.21i, which says it corrects certain power management alerts.
However, since doing the upgrades, I'm still seeing the alerts.
What's more, in addition to this, we've had two power supplies fail in two weeks - both in the same cab and were less the 12 months old.
Now, even if the PSU failures were coincidence, am I right in thinking I would only get these messages if the supply did genuinly fail for the period of time. Or, is there a known issue where BladeCenters report crazy details about their PSUs?
Any help appreciated, as always!!
Cheers
Chris
There's a big star for anyone who can help with this one.
We have 12 IBM BladeCenter chassis. All are 8677 variants.
Each chassis is fully loaded with blades, modules and power supplies.
Each power domain (two PSUs) in each chassis is connected to a different distribution block in the cab. The distribution blocks in each cab are on different phases.
We have four cabs - 3 chassis per cab.
We're getting alerts on a regular basis of a one or two second interval where power appears to have been cut for a power supply in one of the domains. The messages are as follows:
Power modules are nonredundant in domain x
This is happening on several chassis which are in different cabs, and there is no clear pattern to it.
Having pushed and pushed the people in our data suite, they tell me that there is nothing wrong with the supply. I am less convinced, but was willing to look elsewhere. So, I upgraded the MM firmware on every chassis to 1.21i, which says it corrects certain power management alerts.
However, since doing the upgrades, I'm still seeing the alerts.
What's more, in addition to this, we've had two power supplies fail in two weeks - both in the same cab and were less the 12 months old.
Now, even if the PSU failures were coincidence, am I right in thinking I would only get these messages if the supply did genuinly fail for the period of time. Or, is there a known issue where BladeCenters report crazy details about their PSUs?
Any help appreciated, as always!!
Cheers
Chris