Tek-Tips is the largest IT community on the Internet today!

Members share and learn making Tek-Tips Forums the best source of peer-reviewed technical information on the Internet!

  • Congratulations Mike Lewis on being selected by the Tek-Tips community for having the most helpful posts in the forums last week. Way to Go!

Linus Software RAID failure - NEED HELP!

Status
Not open for further replies.

tkconn

IS-IT--Management
Jan 7, 2006
1
0
0
US
I have a fairly confusing situation and am in need of some help and direction.

We have a NAS appliance running linux and software RAID (IDE). The unit has 4 250GB drives in it.
The 4 drives are numbered from left to right 1 - 4. A few weeks ago, #2 lit up amber and was reporting as a failed drive. Upon further discovery we learned that the 4 drives we purchased for this unit are "fake" refurbed Maxtor drives. Maxtor told us to get our data off those drives ASAP. So we ordered 4 new drives from a more reliable source.

We took out # 2, put a new drive in, the array re'syncd and a few hours later, #4 lit up with an amber light and we pulled that one out and put a new drive in and told it to re-sync.

Here is where the fun begins...
While trying to sync up #4, #2 lit amber again and the sync could not complete. The system would hang and the unit would need to be reset.
The now 3 drive array was running degraded and every time we tried to get data off, the system would hang becuase of #2.
So, we figured something's wrong here because #2 is brand new! Well...#2 was a tad differenet from the other 3 in which it had a warranty sticker on the front the other 3 didn't have...and Maxtor then tells us "oh, that's a refurbed drive...legitimate, and under warranty, but refurbed".

We took the drive out, put it in a regular system, tested the drive and SMART says it's going to fail and then a diag tells us the drive has tons of bad sectors.
We try to run SpinRite on the drive and 75% into repairing the drive, it finally fails!

So here's what we have,
all 4 original drives in tact with data (but #2 and #4 are no longer recognized because of the re-sync) and 4 new drives that we can't do anything with until we figure out what to do in regards to getting the data.

We figure if we SpinRite the original 4 and then somehow edit the partitions on the original #2 and #4, we can get the array back up and try this whole thing again.

Is that possible since it's software RAID? There are 5 partitions on each drive and with all the utilities we have, we can see them and clone the drives...but with the sync all messed up, we are stuck.

And of course, our backups are not as up to date as they should have been and there is data we MUST retrieve.

Any help, suggestions, directions would be most appreciated!

 
What RAID level does your NAS have? If its a x4 RAID5 with disks 2 & 4 down, the whole array should be stuffed by now! Does your NSA have mdadm installed? If so, what does mdadm --detail [MD_device] show?


--== Anything can go wrong. It's just a matter of how far wrong it will go till people think its right. ==--
 
Status
Not open for further replies.

Part and Inventory Search

Sponsor

Back
Top