Software raid5 mdadm

PietdeBoer · Posted: Tue Sep 25, 2007 2:06 pm Post subject: Software raid5 mdadm

Hey guys,

i've set up a software raid5 array on 4 300GB Sata disks, using MDADM.

my mdadm.conf:

HeissFuss · Guru Joined: 11 Jan 2005 Posts: 414

Yes. Use mdadm /dev/md0 --fail /dev/sdx option to fail out the bad disk, then use --remove to remove the disk from the array.
/dev/md0 will then function like a 3 disk raid 0. You can still read/write to it. Just use --add to add your new disk after you have installed it.

PietdeBoer · Posted: Tue Sep 25, 2007 2:21 pm Post subject:

ok, so --fail option will remove the bad drive, and rebuild the array (live?) on the 3 remaining disks, so the array will be functional while i replace the 4th disk?

thx for your fast answer!
_________________
_ Got Root? _

Mad Merlin · Veteran Joined: 09 May 2005 Posts: 1155

Your mdadm output indicates that you've got 4 devices and 1 spare, but your df -h suggests that you're using 4 disks actively. Either your actually have 5 disks (and 1 spare) or 4 disks (and 0 spares). But, it doesn't matter either way. If you do have a spare, you can remove one of the currently active disks and the spare will be pulled into the set of active disks. If you don't have a spare, then your RAID array will run in degraded mode until you add another disk.

As for hotplugging another disk, it depends on your hardware. Some SATA controllers support hotplug, and some don't. Here's a status report on various SATA features for different hardware, it's for kernels a few releases back, though. (But current features are likely a superset of what they were then.)
_________________
Game! - Where the stick is mightier than the sword!

HeissFuss · Guru Joined: 11 Jan 2005 Posts: 414

Can you post the output from cat /proc/mdstat ?

If your raid encountered errors it may already have failed the bad drive. If not, you need to --fail it and then --remove it. With one drive failed, your raid is in a degraded state with 3 drives striping without parity. You can still read/write and the partition will still be mounted/active. When you --add your new device (after you've physically installed it) the array will be rebuilt at that point. You can see that status of the rebuild by cat /proc/mdstat. The rebuild will slow performance on that partition, but it will remain active.

PietdeBoer · Posted: Wed Sep 26, 2007 4:24 pm Post subject:

HeissFuss · Guru Joined: 11 Jan 2005 Posts: 414

Are you sure that the disk is bad? Segfaults are app issues. Usually disk errors show up as I/O read/seek errors.

Have you installed a new kernel before these errors started occurring?

Sub Zero · Posted: Fri Oct 05, 2007 9:49 pm Post subject: