Software RAID5 problems [Closed]

ctav01 · Last edited by ctav01 on Thu Nov 08, 2007 7:10 pm; edited 1 time in total

I'm not sure if one of my drives failed or the array just stopped.

Cyker · Veteran Joined: 15 Jun 2006 Posts: 1746

Can you do

ctav01 · Posted: Wed Oct 17, 2007 8:26 pm Post subject:

Cyker · Veteran Joined: 15 Jun 2006 Posts: 1746

Erk... that doesn't look good...

It seems one of the drives has stopped being detected and the other has, for some reason, been relegated to 'spare'.
You can't (normally) have a 2-device RAID5 array which is why the array has stopped itself (It, too, is also probably thinking WTF?!

)

Check the drives, esp. that the connectors are all in firmly (One thing I hate about SATA vs IDE - SATA plug/socket design and build quality is, by and large utter utter crap, and very vulnerable to 'chip creep'. Should be plug creep i guess

)

We need three drives at least to have a degraded RAID5, so hopefully the spare or the missing can be coaxed into being re-integrated into the array, then it won't mater if the 4th one can be re-integrated or just rebuilt...

ctav01 · Posted: Mon Oct 22, 2007 4:51 am Post subject:

Sorry, it took me a while to get the machine on a bench. So I double-checked the SATA connections and:

ctav01 · Posted: Tue Oct 23, 2007 11:17 pm Post subject:

heschne · n00b Joined: 25 Oct 2007 Posts: 1 Location: Bavaria

Still interested in this?

I got the following
(after adding another controller, readjusting the cables changing sd? sequence, reboot, change my mind and rearranging again)

ctav01 · Posted: Fri Oct 26, 2007 6:05 am Post subject:

Definitely still interested.

ctav01 · Posted: Mon Oct 29, 2007 1:31 pm Post subject:

ctav01 · Posted: Wed Oct 31, 2007 12:19 am Post subject:

Still need help with this. I tried assembling the array and this is what I got. Not sure what the slot 3 thing means.

ctav01 · Posted: Thu Nov 01, 2007 1:58 am Post subject:

Well, I was able to get it assembled but it looks like my data is all gone. And for a while there, it was showing a failed drive but now it all seems fine so I'm not sure what to do now. Any comments please?

Cyker · Veteran Joined: 15 Jun 2006 Posts: 1746

Got your PM; Sorry, I haven't added anything because TBH I don't have much to add!!

Its very odd that the array would have just gone; I mean, you can check data and power cable connectors are secure at both ends, that they have enough power driving them and all of that, but to get a double-HD failure is quite worrying!

The data loss is likely due to you essentially reconstructing the array, but if two of the disks had really failed then that data would have been gone anyway...

All I can think of is to run the manufacturer's HD utils on all the drive (I have an old version of Maxtor's PowerMAX which I boot with GRUB off a syslinux floopy emulator into FreeDOS) to make sure there is no impending problems, and also memtest the heck out of your RAM just in case.

Bad RAM is one of the deadliest things for data corruption, which is why server people are willing to pay the superlatively extortionate price for ECC memory.

This is all standard checking stuff 'thi - I really have no clue as to what might have caused your problem in the first place

(And now I'm terrified of the possibility of it happening to me; I haven't got enough money to back up my own array, which is of a similar configuration!! :shock:

)

ctav01 · Posted: Sat Nov 10, 2007 6:22 am Post subject:

Thanks for the reply.

Actually, I got very lucky. I have no idea why the array went down but the forced assemble seemed to restore it and I can't find any missing data (I'm backing everything up as quick as I can though).

I'm not exactly sure how SMART works but I've got it looking at all the drives and one is showing 994 errors and a second one 17 errors so I think I've found the one to replace.

Thanks again for your help.

Cyker · Veteran Joined: 15 Jun 2006 Posts: 1746