Hello,
i'm facing a problem which seems to be related to memory modules failure on HP Proliant DL380 G3 netserver. First of all, this problem results in server rebooting which usually happens in evening time. I tested the server with SmartStart CD and found out that the server can't pass total memory test. There are 2 tests: Noise test and Chache test. Status for both tests is Failed.
This is quotation from DiagTicket i got after testing:
"This test failed as a result of the ECC (error correcting code) reporting an error while the test was operating. While not a problem with the test itself, this indicates that there was an ECC error incident while the test was running. Check the IML for any ECC Threshold Passed events. The appropriate DIMM will be noted in the error message itself."
The test results point out to correctable errors in memory modules. But... i replaced old DIMMs with new ones and nothing changed. I did it twice with 2 different pairs of new DIMMs and result was the same. Finally i replaced the system board of the server with new one and ran the test from SmartStart CD over again. The result was quite different. The server passed Noise test and Chache test perfectly. But after a few hours the server rebooted again on its own. The test from SmartStart CD was run again on the server and it again failed Noise and Chache memory tests. Since then the situation is the same. The problem seems to be a hardware problem because it doesn't depend on the software installed.
What can possibly be the root of the problem? (I have 2 DL380 servers with same problem.)
i'm facing a problem which seems to be related to memory modules failure on HP Proliant DL380 G3 netserver. First of all, this problem results in server rebooting which usually happens in evening time. I tested the server with SmartStart CD and found out that the server can't pass total memory test. There are 2 tests: Noise test and Chache test. Status for both tests is Failed.
This is quotation from DiagTicket i got after testing:
"This test failed as a result of the ECC (error correcting code) reporting an error while the test was operating. While not a problem with the test itself, this indicates that there was an ECC error incident while the test was running. Check the IML for any ECC Threshold Passed events. The appropriate DIMM will be noted in the error message itself."
The test results point out to correctable errors in memory modules. But... i replaced old DIMMs with new ones and nothing changed. I did it twice with 2 different pairs of new DIMMs and result was the same. Finally i replaced the system board of the server with new one and ran the test from SmartStart CD over again. The result was quite different. The server passed Noise test and Chache test perfectly. But after a few hours the server rebooted again on its own. The test from SmartStart CD was run again on the server and it again failed Noise and Chache memory tests. Since then the situation is the same. The problem seems to be a hardware problem because it doesn't depend on the software installed.
What can possibly be the root of the problem? (I have 2 DL380 servers with same problem.)