Gentoo Forums
Gentoo Forums
Gentoo Forums
Quick Search: in
System crashing, kernel errors, etc. Need help.
View unanswered posts
View posts from last 24 hours

 
Reply to topic    Gentoo Forums Forum Index Kernel & Hardware
View previous topic :: View next topic  
Author Message
lmegliol
n00b
n00b


Joined: 12 Sep 2005
Posts: 68

PostPosted: Tue Dec 11, 2007 3:09 am    Post subject: System crashing, kernel errors, etc. Need help. Reply with quote

A server of mine has been running well for a long time. I have not made any changes to this server recently, but yesterday it crashed. Upon rebooting the system, it crashed again in short order.

I have run memtest86 on the system and after three passes it showed no errors. Assuming that the memory is OK, I have no idea what the problem is.

I had netconsole setup to send output from the console to another system, but for some reason that is no longer working. Since I am not anywhere near this system, I have no other easy way to see the errors. I had someone send me digital photos of the screen when it crashes.

Can anyone take a look at these and make a few suggestions?

Thanks in advance.

http://img257.imageshack.us/img257/1822/96480978dv1.th.jpg

http://img257.imageshack.us/img257/3649/76587291zi9.th.jpg

http://img504.imageshack.us/img504/9606/62655257bt7.th.jpg

http://img257.imageshack.us/img257/1636/27665837so3.th.jpg

http://img504.imageshack.us/img504/5471/dsc05254ld4.th.jpg
Back to top
View user's profile Send private message
djinnZ
Advocate
Advocate


Joined: 02 Nov 2006
Posts: 4831
Location: somewhere in L.O.S.

PostPosted: Tue Dec 11, 2007 10:50 am    Post subject: Reply with quote

on some of my old PCs the boot will fail why the ide controller or the network card are damaged.
Or you can measure (with a tester) the tension output. After long time some crappy power units need to be substituted.
_________________
scita et risus abundant in ore stultorum sed etiam semper severi insani sunt:wink:
mala tempora currunt...mater stultorum semper pregna est :evil:
Murpy'sLaw:If anything can go wrong, it will - O'Toole's Corollary:Murphy was an optimist :wink:
Back to top
View user's profile Send private message
Dagger
Retired Dev
Retired Dev


Joined: 11 Jun 2003
Posts: 765
Location: UK

PostPosted: Tue Dec 11, 2007 12:06 pm    Post subject: Reply with quote

I would recommend to test your memory (memtest86) a bit more
_________________
95% of all computer errors occur between chair and keyboard (TM)
Join the FSF as an Associate Member!
Post under CC license.


Last edited by Dagger on Tue Dec 11, 2007 1:42 pm; edited 1 time in total
Back to top
View user's profile Send private message
energyman76b
Advocate
Advocate


Joined: 26 Mar 2003
Posts: 2048
Location: Germany

PostPosted: Tue Dec 11, 2007 1:19 pm    Post subject: Reply with quote

three passes are not enough.

I would go down the PSU route too.

Oh, and when you are replacing stuff - try replacing the kernel too. 2.6.17 is acient - and lots of bugs and holes have been fixed...
_________________
Study finds stunning lack of racial, gender, and economic diversity among middle-class white males

I identify as a dirty penismensch.
Back to top
View user's profile Send private message
lmegliol
n00b
n00b


Joined: 12 Sep 2005
Posts: 68

PostPosted: Thu Dec 13, 2007 11:40 pm    Post subject: Reply with quote

Exactly how many passes am I looking for on the memtest? Last time I checked it was at over 12 with zero errors.

The PSU? Really? Can someone explain the logic behind this one, because apparently my imagination is running short today?

As far as the kernel goes, yeah I know it is old. But unless there is a hardware problem, would an older kernel that has exhibited no problems since installation just suddenly start causing problems?

What about a CPU failure? Does this sound like something that could be the cause?
Back to top
View user's profile Send private message
energyman76b
Advocate
Advocate


Joined: 26 Mar 2003
Posts: 2048
Location: Germany

PostPosted: Fri Dec 14, 2007 1:04 am    Post subject: Reply with quote

lmegliol wrote:
Exactly how many passes am I looking for on the memtest? Last time I checked it was at over 12 with zero errors.

The PSU? Really? Can someone explain the logic behind this one, because apparently my imagination is running short today?

As far as the kernel goes, yeah I know it is old. But unless there is a hardware problem, would an older kernel that has exhibited no problems since installation just suddenly start causing problems?

What about a CPU failure? Does this sound like something that could be the cause?


12 memtest passes are ok. 20 better, 100.. you can't have enough.. because sometimes even after 1000 passes some error might slip through ;)

The PSU: it does contain caps. And caps become weak - the warmer the faster (they loose some of their capacitance). This results in fluctuating voltages, voltages becoming too low or th PSU is not be able to provide enough amperes anymore. If that happens, your computer becomes extremly crashy - because jumping around voltages or voltages dropping below the tolerances (or going to high to compensate for missing amperes), is something delicate electronics don't stomach well.

I had several cases where a weak PSU was the cause of a lot of problems. New PSU, problems gone.

CPU failure is extremly rare (except when you are overclocking Intel P4). And CPU failure is usually catastrophic...
_________________
Study finds stunning lack of racial, gender, and economic diversity among middle-class white males

I identify as a dirty penismensch.
Back to top
View user's profile Send private message
Display posts from previous:   
Reply to topic    Gentoo Forums Forum Index Kernel & Hardware All times are GMT
Page 1 of 1

 
Jump to:  
You cannot post new topics in this forum
You cannot reply to topics in this forum
You cannot edit your posts in this forum
You cannot delete your posts in this forum
You cannot vote in polls in this forum