Gentoo Forums
Gentoo Forums
Gentoo Forums
Quick Search: in
EVMS RAID5 failed--need plenty of help [SOLVED]
View unanswered posts
View posts from last 24 hours

 
Reply to topic    Gentoo Forums Forum Index Kernel & Hardware
View previous topic :: View next topic  
Author Message
jjlawren
n00b
n00b


Joined: 06 Feb 2006
Posts: 36

PostPosted: Wed Jun 28, 2006 3:02 pm    Post subject: EVMS RAID5 failed--need plenty of help [SOLVED] Reply with quote

Apparently last night my server had some errors with one of the disks in my 4 disk RAID5 array. The following kinds of errors were repeated for several minutes (all on sdb):

Jun 28 05:01:45 backend ata2: translated ATA stat/err 0x51/40 to SCSI SK/ASC/ASCQ 0x3/11/04
Jun 28 05:01:45 backend ata2: status=0x51 { DriveReady SeekComplete Error }
Jun 28 05:01:45 backend ata2: error=0x40 { UncorrectableError }
Jun 28 05:01:49 backend ata2: translated ATA stat/err 0x51/40 to SCSI SK/ASC/ASCQ 0x3/11/04
Jun 28 05:01:49 backend ata2: status=0x51 { DriveReady SeekComplete Error }
Jun 28 05:01:49 backend ata2: error=0x40 { UncorrectableError }
Jun 28 05:01:53 backend ata2: translated ATA stat/err 0x51/40 to SCSI SK/ASC/ASCQ 0x3/11/04
Jun 28 05:01:53 backend ata2: status=0x51 { DriveReady SeekComplete Error }
Jun 28 05:01:53 backend ata2: error=0x40 { UncorrectableError }
Jun 28 05:01:56 backend ata2: translated ATA stat/err 0x51/40 to SCSI SK/ASC/ASCQ 0x3/11/04
Jun 28 05:01:56 backend ata2: status=0x51 { DriveReady SeekComplete Error }
Jun 28 05:01:56 backend ata2: error=0x40 { UncorrectableError }
Jun 28 05:02:00 backend ata2: translated ATA stat/err 0x51/40 to SCSI SK/ASC/ASCQ 0x3/11/04
Jun 28 05:02:00 backend ata2: status=0x51 { DriveReady SeekComplete Error }
Jun 28 05:02:00 backend ata2: error=0x40 { UncorrectableError }
Jun 28 05:02:03 backend ata2: translated ATA stat/err 0x51/40 to SCSI SK/ASC/ASCQ 0x3/11/04
Jun 28 05:02:03 backend ata2: status=0x51 { DriveReady SeekComplete Error }
Jun 28 05:02:03 backend ata2: error=0x40 { UncorrectableError }
Jun 28 05:02:03 backend sd 1:0:0:0: SCSI error: return code = 0x8000002
Jun 28 05:02:03 backend sdb: Current: sense key=0x3
Jun 28 05:02:03 backend ASC=0x11 ASCQ=0x4
Jun 28 05:02:03 backend end_request: I/O error, dev sdb, sector 541945335
Jun 28 05:02:49 backend ata2: translated ATA stat/err 0x51/40 to SCSI SK/ASC/ASCQ 0x3/11/04
Jun 28 05:02:49 backend ata2: status=0x51 { DriveReady SeekComplete Error }
Jun 28 05:02:49 backend ata2: error=0x40 { UncorrectableError }
Jun 28 05:02:53 backend ata2: translated ATA stat/err 0x51/40 to SCSI SK/ASC/ASCQ 0x3/11/04
Jun 28 05:02:53 backend ata2: status=0x51 { DriveReady SeekComplete Error }
Jun 28 05:02:53 backend ata2: error=0x40 { UncorrectableError }
Jun 28 05:02:56 backend ata2: translated ATA stat/err 0x51/40 to SCSI SK/ASC/ASCQ 0x3/11/04
Jun 28 05:02:56 backend ata2: status=0x51 { DriveReady SeekComplete Error }
Jun 28 05:02:56 backend ata2: error=0x40 { UncorrectableError }
Jun 28 05:03:00 backend ata2: translated ATA stat/err 0x51/40 to SCSI SK/ASC/ASCQ 0x3/11/04
Jun 28 05:03:00 backend ata2: status=0x51 { DriveReady SeekComplete Error }
Jun 28 05:03:00 backend ata2: error=0x40 { UncorrectableError }
Jun 28 05:03:04 backend ata2: translated ATA stat/err 0x51/40 to SCSI SK/ASC/ASCQ 0x3/11/04
Jun 28 05:03:04 backend ata2: status=0x51 { DriveReady SeekComplete Error }
Jun 28 05:03:04 backend ata2: error=0x40 { UncorrectableError }
Jun 28 05:03:07 backend ata2: translated ATA stat/err 0x51/40 to SCSI SK/ASC/ASCQ 0x3/11/04
Jun 28 05:03:07 backend ata2: status=0x51 { DriveReady SeekComplete Error }
Jun 28 05:03:07 backend ata2: error=0x40 { UncorrectableError }
Jun 28 05:03:07 backend sd 1:0:0:0: SCSI error: return code = 0x8000002
Jun 28 05:03:07 backend sdb: Current: sense key=0x3
Jun 28 05:03:07 backend ASC=0x11 ASCQ=0x4
Jun 28 05:03:07 backend end_request: I/O error, dev sdb, sector 544130991
Jun 28 05:03:08 backend raid5: read error corrected!!

From the system log, it looks like the machine rebooted after these errors. Now EVMS is reporting that the MD superblocks on the first 2 drives (of 4) are not valid, and should be removed from the region. Is there any chance of this being recovered?


Last edited by jjlawren on Thu Jun 29, 2006 1:39 pm; edited 1 time in total
Back to top
View user's profile Send private message
jjlawren
n00b
n00b


Joined: 06 Feb 2006
Posts: 36

PostPosted: Wed Jun 28, 2006 3:41 pm    Post subject: Reply with quote

Can I use mdadm in some way? I assume EVMS is just a wrapper for the kernel support.

Also, LVM2 is being used as well, but I assume that if I can solve the superblock problem, that won't be as big of a problem. EVMS does show the LVM volumes, but cannot activate the array.
Back to top
View user's profile Send private message
jjlawren
n00b
n00b


Joined: 06 Feb 2006
Posts: 36

PostPosted: Thu Jun 29, 2006 1:39 pm    Post subject: Reply with quote

Fixed with the following command:

mdadm -C /dev/md0 --level=5 --chunk=128 --layout=la --raid-devices=4 /dev/mapper/sda1 missing /dev/mapper/sdc1 /dev/mapper/sdd1

Since one disk was marked as missing, the array would not attempt to resync and effectively made it read-only (unless I made changes to it myself). I ran fsck on each of the LVM2 partitions that existed on the array and everything looked fine. Added sdb1 back into the array as a spare and let it sync.

Unfortunately it looks like sdb is actually bad, and it was removed a few hours later when it failed. But my data is intact and a replacement drive is on the way.
Back to top
View user's profile Send private message
Display posts from previous:   
Reply to topic    Gentoo Forums Forum Index Kernel & Hardware All times are GMT
Page 1 of 1

 
Jump to:  
You cannot post new topics in this forum
You cannot reply to topics in this forum
You cannot edit your posts in this forum
You cannot delete your posts in this forum
You cannot vote in polls in this forum