Gentoo Forums
Gentoo Forums
Gentoo Forums
Quick Search: in
Dell CERC SATA 1.5/6xh Raid controller, reiser bug [SOLVED].
View unanswered posts
View posts from last 24 hours

 
Reply to topic    Gentoo Forums Forum Index Kernel & Hardware
View previous topic :: View next topic  
Author Message
r3tude
n00b
n00b


Joined: 12 Jan 2005
Posts: 18

PostPosted: Mon May 15, 2006 11:00 am    Post subject: Dell CERC SATA 1.5/6xh Raid controller, reiser bug [SOLVED]. Reply with quote

Hi all

Some history we bought this server last year, with 794GB Raid array to take over from out PDC's fileserver roll. The configuration was easy enough got us a good reliable fileserver for a week then all went wrong and dell came out. Since then we have had 3 Drive failures one of the a dual failure losing lots of data, dell replaced the drives. Now I am getting drive failures on what I can see are working drives. Theyre Maxtor drives which may explain it, but i run maxtor's utility brun in test and they work fine, but in this server one fails and hotspare takes over then the hotpsare fails and the buggered one takes over and it keeps going in the loop.

The main thing is that every time a drive fails in this raid 5 array the whole server stops responding and goes down, its usually fine after a hard reboot and caries on rebuilding the array but today ive had to do a --rebuild-tree to fix it.

so the main question is, has anyone had experience with this raid card under gentoo and have you had any problems. I am thinking its more to do with it being Dell and maxtor hardware not what i would class good by anystandards, but i need to cover all avenues


Last edited by r3tude on Fri May 19, 2006 2:30 pm; edited 1 time in total
Back to top
View user's profile Send private message
HackingM2
Apprentice
Apprentice


Joined: 26 Jul 2004
Posts: 245
Location: Cambridge, England

PostPosted: Wed May 17, 2006 9:30 am    Post subject: Reply with quote

I have had similar problems on a server I own. It used to do pretty much what you described. It wasn't a Dell though.

What fixed the problems in the end was a new PSU. Turned out that the old one had an under-voltage 12v rail which was making the drives behave like they had failed. Swapped it out for a better model (650w tripple redundant) and all has been well since - the server sounds like an aircraft taking off but it works. :)

You may want to invest in a half-decent digital multi-meter and see what it says about that. I know what Dell support techs are like and I can't imagine them checking it. Try loading the system with lots of CPU and disk activity while you test. Mine used to drop to about 10.2v. :roll:
Back to top
View user's profile Send private message
r3tude
n00b
n00b


Joined: 12 Jan 2005
Posts: 18

PostPosted: Thu May 18, 2006 9:18 am    Post subject: Reply with quote

thanks for the info, I'll dig out my multimeter now

I never thought of the PSU, it makes sense.
Back to top
View user's profile Send private message
r3tude
n00b
n00b


Joined: 12 Jan 2005
Posts: 18

PostPosted: Fri May 19, 2006 2:29 pm    Post subject: Reply with quote

Sorted there was a bug in reiserfs affecting large filesystems, it was documented on a 1.4TB filesystem here http://www.mail-archive.com/reiserfs-list@namesys.com/msg20923.html.

It was causing massive server load and read write failures, i've upgraded my system and done a kernel upgrade and it seems fine now, I am going to keep an eye on the raid array just encase this was a secondary problem.
Back to top
View user's profile Send private message
HackingM2
Apprentice
Apprentice


Joined: 26 Jul 2004
Posts: 245
Location: Cambridge, England

PostPosted: Fri May 19, 2006 3:21 pm    Post subject: Reply with quote

Interesting. I shall have to watch out for that as I have some filesystems approaching that size.

I don't want to put a dampner on things but I have to say that if the controller is deciding that the drive is failed (and switching to a hot-spare) I would be very surprised indeed if it was a software issue. In my experience these things always turn out to be hardware - usually PSU or RAM.

Still... Glad to hear it is working now. I hope it continues to do so. :)
Back to top
View user's profile Send private message
Display posts from previous:   
Reply to topic    Gentoo Forums Forum Index Kernel & Hardware All times are GMT
Page 1 of 1

 
Jump to:  
You cannot post new topics in this forum
You cannot reply to topics in this forum
You cannot edit your posts in this forum
You cannot delete your posts in this forum
You cannot vote in polls in this forum