Over the weekend we lost an array twice. Here are the specs of the system:
Dell PE 1750 w/onboard Perc and Raid 1 136GB drives
Powervault 220 Array enclosure
7 x 136GB HDD drives
Adaptec 2120S RAID controller
This machine is an exchange server
In the enclosure it showed we lost 4 drives all in the same 30 second period of time and we lost the array. It showed ID 9 12 13 14 were lost. We shutdown the system and reseated all drives. It came up with drive 12 failed and the array degraded. According to the log this was the first that failed when all drives went downWe reseated drive 12 again and the array rebuilt and was up and running like normal. 2 hours after the rebuild we lost drive 5 and when we went to the logs only drive 5 showed as failed. This drive has since been replaced and is currently in the process of rebuilding the array. Since this was multiple hardware failures we grabbed another temporary enclosure, attached it to the onboard RAID controller configured an array and pulled the exchange DBs off the problematic array and moved them to the temp array. The problem is now I do not know where to go to resolve the issue. I am not convinced it is a simple drive failure since it has always reported different drives. Since all of the DBs are not on the array in question anymore it seems fine but it also is just sitting there not doing anything. Does anyone know of any test software to do read/write tests on this array? Has anyone seen an issue like this before?
Dell PE 1750 w/onboard Perc and Raid 1 136GB drives
Powervault 220 Array enclosure
7 x 136GB HDD drives
Adaptec 2120S RAID controller
This machine is an exchange server
In the enclosure it showed we lost 4 drives all in the same 30 second period of time and we lost the array. It showed ID 9 12 13 14 were lost. We shutdown the system and reseated all drives. It came up with drive 12 failed and the array degraded. According to the log this was the first that failed when all drives went downWe reseated drive 12 again and the array rebuilt and was up and running like normal. 2 hours after the rebuild we lost drive 5 and when we went to the logs only drive 5 showed as failed. This drive has since been replaced and is currently in the process of rebuilding the array. Since this was multiple hardware failures we grabbed another temporary enclosure, attached it to the onboard RAID controller configured an array and pulled the exchange DBs off the problematic array and moved them to the temp array. The problem is now I do not know where to go to resolve the issue. I am not convinced it is a simple drive failure since it has always reported different drives. Since all of the DBs are not on the array in question anymore it seems fine but it also is just sitting there not doing anything. Does anyone know of any test software to do read/write tests on this array? Has anyone seen an issue like this before?