Gentoo Forums
Gentoo Forums
Gentoo Forums
Quick Search: in
Recurring kernel panic, vfss not syncing, I have the error
View unanswered posts
View posts from last 24 hours

 
Reply to topic    Gentoo Forums Forum Index Kernel & Hardware
View previous topic :: View next topic  
Author Message
Thisp
n00b
n00b


Joined: 29 Jul 2006
Posts: 46

PostPosted: Fri Feb 02, 2007 7:13 am    Post subject: Recurring kernel panic, vfss not syncing, I have the error Reply with quote

Code:
Hardware error
CPU0: Machine check exception
4 bank 4: 48587524039(bunch of crap)
T = C43294234729083(more crap)

Thisd is not a software problem
Run through mcelog & ascii to decide and contact your hardware vendor
Kenrel panic - not syncing : machine check


That was it.

The thing's folded for over a week, and it was just running memtest like a champ. the HD wasn't clicking.

This has been a recurring error, maybe every few weeks-month since I setup this router/fileserver. kernel panic - VFS not syncing, or kernel panic - not syncing, but this was the only one that wasn't fixed by a reboot. I reset failsafe defaults in the BIOS and only now does it get in line and work, but I also opened the case to check connections, I'm not sure which one fixed it.

I was wondering if anyone could give me any insight as to where the issue is. I'm out of ideas.. if it can fold for over a week, run dnetc/RC5-72 for over two weeks before I stop it myself, then I don't understand why it does this. I tried swapping PSUs out, and CPUs(this happened on a venice 3000+ and an opteron 165, an antec truepower 2.0 550 and a sparkle FSP550). I'm pretty certain it's not hardware error, because if it was I would have seen this when I run windows stuff on it.

This has also happened on two different physical OS drives.. the OS has been on two different drives so far, same thing,


Last edited by Thisp on Fri Feb 02, 2007 7:24 am; edited 1 time in total
Back to top
View user's profile Send private message
bunder
Bodhisattva
Bodhisattva


Joined: 10 Apr 2004
Posts: 5947

PostPosted: Fri Feb 02, 2007 7:23 am    Post subject: Reply with quote

tried turning off MCE in the kernel?
_________________
Neddyseagoon wrote:
The problem with leaving is that you can only do it once and it reduces your influence.

banned from #gentoo since sept 2017
Back to top
View user's profile Send private message
Thisp
n00b
n00b


Joined: 29 Jul 2006
Posts: 46

PostPosted: Fri Feb 02, 2007 7:26 am    Post subject: Reply with quote

bunder wrote:
tried turning off MCE in the kernel?


Honestly - I don't even know what MCE is. I've barely used linux before this, I jumped straight into gentoo, tried compiling kernels and configuring a router and LVM and raid without any practical linux experience, learning with the help of google.com/linux and IRC as I went along... so I'm unfamiliar with a lot of this.

I will search for that option tonight, recompile a seperate kernel and keep the old one incase something gets wrong, and see how that works.

Thanks a lot for your suggestion. :)
Back to top
View user's profile Send private message
bunder
Bodhisattva
Bodhisattva


Joined: 10 Apr 2004
Posts: 5947

PostPosted: Fri Feb 02, 2007 7:27 am    Post subject: Reply with quote

Thisp wrote:
bunder wrote:
tried turning off MCE in the kernel?


Honestly - I don't even know what MCE is. I've barely used linux before this, I jumped straight into gentoo, tried compiling kernels and configuring a router and LVM and raid without any practical linux experience, learning with the help of google.com/linux and IRC as I went along... so I'm unfamiliar with a lot of this.

I will search for that option tonight, recompile a seperate kernel and keep the old one incase something gets wrong, and see how that works.

Thanks a lot for your suggestion. :)


its called machine check exception, on the same page where you choose your cpu type.

cheers
_________________
Neddyseagoon wrote:
The problem with leaving is that you can only do it once and it reduces your influence.

banned from #gentoo since sept 2017
Back to top
View user's profile Send private message
Thisp
n00b
n00b


Joined: 29 Jul 2006
Posts: 46

PostPosted: Fri Feb 02, 2007 7:34 am    Post subject: Reply with quote

I spoke wrong - I know where it is since I learned how to search the kernel recently, I just have no idea what machine check support is/does/what it is used for/why it would cause kernel crashes is all.

Thanks again for your help.
Back to top
View user's profile Send private message
bunder
Bodhisattva
Bodhisattva


Joined: 10 Apr 2004
Posts: 5947

PostPosted: Fri Feb 02, 2007 7:43 am    Post subject: Reply with quote

wikipedia wrote:
A Machine Check Exception is a hardware error which occurs when a computer processor detects an unrecoverable hardware problem.

The error is usually due to failure or overstressing of hardware components where the error cannot be more specifically identified with another error message. Diagnosing the error message can be difficult, although Intel Pentium processors do generate more specific codes which can be decoded by contacting the manufacturer.

MCE's require a restart to continue and often indicate a long term general problem.

Most of these errors are specific to the Pentium processor family, similar errors may occur on other processors and will cause the same problems.

Here are some of the main hardware problems that cause MCE's:

* System bus errors (error communicating between the processor and the motherboard)
* Memory errors that may include parity / Error correction code (ECC) problems. Error checking ensures that data is stored correctly in the RAM, if information is corrupted then random errors occur.
* Cache errors in the processor, the cache stores important data and code. If this is corrupted errors often occur


but since you checked your RAM and CPU, it may just be that the code is flaky on your particular system.
_________________
Neddyseagoon wrote:
The problem with leaving is that you can only do it once and it reduces your influence.

banned from #gentoo since sept 2017
Back to top
View user's profile Send private message
Thisp
n00b
n00b


Joined: 29 Jul 2006
Posts: 46

PostPosted: Fri Feb 02, 2007 8:02 am    Post subject: Reply with quote

This has happened with enough different components for me to say, bunk on MCE. I know opterons are unsupported in socket 939 boards, but the venice 3000+ isn't. This has happened with too many different hardware combinations.

Thanks again for all your help - this forum is an awesome resource.
Back to top
View user's profile Send private message
Thisp
n00b
n00b


Joined: 29 Jul 2006
Posts: 46

PostPosted: Fri Feb 02, 2007 9:01 pm    Post subject: Reply with quote

Hmm. It happened again today, even with the new kernel. kernel panic, VFS not syncing, had to reboot to fix it. :(
Back to top
View user's profile Send private message
Display posts from previous:   
Reply to topic    Gentoo Forums Forum Index Kernel & Hardware All times are GMT
Page 1 of 1

 
Jump to:  
You cannot post new topics in this forum
You cannot reply to topics in this forum
You cannot edit your posts in this forum
You cannot delete your posts in this forum
You cannot vote in polls in this forum