View previous topic :: View next topic |
Author |
Message |
dendenners n00b
![n00b n00b](/images/ranks/rank_rect_0.gif)
![](images/avatars/gallery/Futurama/cartoon_futurama_zapp_brannigan.gif)
Joined: 17 Oct 2003 Posts: 28 Location: Inside a shirt
|
Posted: Fri Nov 28, 2003 9:51 am Post subject: hard drive related reiserfs crash |
|
|
Hi,
I have a problem with a machine running gentoo. Every so often (roughly every 24 hours or so) the machine stops responding to commands (for example when I run programs I often get errors like 'I/O error' or errors relating to the bus) and in the kernel logs I get messages like
end_request: I/O error, cmd 0 dev 03:08(hda) sector 183624
zam_7001: io error in reiserfs_find_entry
The only way I can restart the machine when this happens is to hard restart it.
A number of different sectors come up in the error message at different times. I ran fsck on all the partitions I have, but fsck reckons everything is OK. I am running oracle 9 on the machine - this may have nothing to do with the issue, but I can't recall the problem occuring when oracle was not running on the box. What could be the problem here? Is the hardware dodgy (as I suspect) or are oracle's onerous memory requirements leading to unforseen errors elsewhere? I'd be more than grateful for any pointers
Regards
Denis |
|
Back to top |
|
![](templates/gentoo/images/spacer.gif) |
Janne Pikkarainen Veteran
![Veteran Veteran](/images/ranks/rank_rect_5_vet.gif)
![](images/avatars/10433783463f526aba4144d.jpg)
Joined: 29 Jul 2003 Posts: 1143 Location: Helsinki, Finland
|
Posted: Fri Nov 28, 2003 12:02 pm Post subject: |
|
|
Sounds like a soon-to-be-dead HD. If I were you, I would replace the HD as soon as possible. _________________ Yes, I'm the man. Now it's your turn to decide if I meant "Yes, I'm the male." or "Yes, I am the Unix Manual Page.". |
|
Back to top |
|
![](templates/gentoo/images/spacer.gif) |
perm n00b
![n00b n00b](/images/ranks/rank_rect_0.gif)
![](images/avatars/5043483764231ae879a143.jpg)
Joined: 30 Oct 2003 Posts: 1
|
Posted: Wed Dec 03, 2003 6:05 pm Post subject: |
|
|
You could try to run one of the S.M.A.R.T. tools (emerge ide-smart) and see if any of its tests fails. If it does, you should already have made a backup and ordered a new drive. Alternatively, tar down everything to somewhere else, and reformat the partition in question to e.g. ext3. This doesn't really help if you've got a bad drive, but it might if you've run into some weird reiserfs-related problem.
On second thought, you don't have some cron-job giving the the drive a workout once a day, so that it gets too hot? |
|
Back to top |
|
![](templates/gentoo/images/spacer.gif) |
dendenners n00b
![n00b n00b](/images/ranks/rank_rect_0.gif)
![](images/avatars/gallery/Futurama/cartoon_futurama_zapp_brannigan.gif)
Joined: 17 Oct 2003 Posts: 28 Location: Inside a shirt
|
Posted: Wed Jan 21, 2004 11:25 pm Post subject: |
|
|
OK guys, I've got more info on this. The following data is printed to the console just as the box abends:
Code: |
reiserfs: journal-837: IO error during journal replay
journal-712: buffer write failed
kernel BUG at prints.c:334!
invalid operand: 000
CPU: 0
EIP: 0020:[<c0223acf>] Tainted: GF
EFLAGS: 00010282
eax: 00000024 ebx: df928600 ecx: 00000001 edx: dbe32000
esi: 00000007 edi: e09755fc ebp: 00000374 esp: d3a4be14
ds: 0018 es: 0018 ss: 0018
Process netstat-bf.sh (pid: 4505, stackpage=d3a4b000)
Stack: c031fc99 c015a200 c032dce0 d3a4be34 df928600 c022b71d df928600 c032dce0
e09755fc c022ba50 df928600 0000160f 00007d3d df928600 00000282 00000000
e09755fc 00000374 00000374 00000374 c022ec9a df928600 e09755fc 00000001
Call Trace: [<c022b71d>] [<c022ba50>] [<c022ec9a>] [<c022da4b>] [<c02165ea>]
[<c02b86ca>] [<c01d89a8>] [<c01d8feb>] [<c01cefc2>] [<c01ce90d>] [c019eb73>]
Code: 0f 0b 4e 01 32 32 32 c0 85 db 68 00 a2 15 c0 74 10 0f b7 43
end_request I/O error, cmd 0 dev 03:09 (hda) sector 66648
zam-7002: io error in reiserfs_find_entry
end_request I/O error, cmd 0 dev 03:09 (hda) sector 66648
zam-7002: io error in reiserfs_find_entry
end_request I/O error, cmd 0 dev 03:09 (hda) sector 18616320
end_request I/O error, cmd 0 dev 03:09 (hda) sector 18616320
|
This looks kind of fishy, especially the kernel BUG thing. I was wondering whether it could possibly be due to me having quite a large hard drive on this machine (200G) and reiserfs encountering some addressing problem referencing high parts of the drive (just a thought). Anyway, again any feedback would be greatly appreciated, as I am having to defend the robustness of Linux to the Windows heathen that surround me.
TIA
Denis |
|
Back to top |
|
![](templates/gentoo/images/spacer.gif) |
|
|
You cannot post new topics in this forum You cannot reply to topics in this forum You cannot edit your posts in this forum You cannot delete your posts in this forum You cannot vote in polls in this forum
|
|