Gentoo Forums
Gentoo Forums
Gentoo Forums
Quick Search: in
Harddisk hangup
View unanswered posts
View posts from last 24 hours

 
Reply to topic    Gentoo Forums Forum Index Kernel & Hardware
View previous topic :: View next topic  
Author Message
Psi15
Tux's lil' helper
Tux's lil' helper


Joined: 07 Jan 2003
Posts: 86
Location: Vienna

PostPosted: Thu May 19, 2005 5:56 am    Post subject: Harddisk hangup Reply with quote

Hi Guys!

Very strange things happening over here. My local fileserver is up several days without any troubles, but suddenly one of the disks is simply gone (it is not that much of a problem because the disks are configured in a raid device). This is what the kern.log sais about the disk

Code:

May 18 16:14:24 psi15 kernel: hda: dma_timer_expiry: dma status == 0x60
May 18 16:14:24 psi15 kernel: hda: DMA timeout retry
May 18 16:14:24 psi15 kernel: hda: timeout waiting for DMA
May 18 16:14:24 psi15 kernel: hda: status timeout: status=0xd0 { Busy }
May 18 16:14:24 psi15 kernel:
May 18 16:14:24 psi15 kernel: hdb: DMA disabled
May 18 16:14:24 psi15 kernel: hda: drive not ready for command
May 18 16:14:54 psi15 kernel: ide0: reset timed-out, status=0x80
May 18 16:15:24 psi15 kernel: hda: status timeout: status=0x80 { Busy }
May 18 16:15:24 psi15 kernel:
May 18 16:15:24 psi15 kernel: hda: drive not ready for command
May 18 16:15:24 psi15 kernel: ide0: reset timed-out, status=0x80
May 18 16:15:24 psi15 kernel: blk: queue dfe0e400, I/O limit 4095Mb (mask 0xffffffff)
May 18 16:15:24 psi15 kernel: end_request: I/O error, dev hda, sector 20435526
May 18 16:15:24 psi15 kernel: raid1: Disk failure on hda3, disabling device.
May 18 16:15:24 psi15 kernel: ^IOperation continuing on 1 devices
May 18 16:15:24 psi15 kernel: end_request: I/O error, dev hda, sector 246109
May 18 16:15:24 psi15 kernel: Buffer I/O error on device hda2, logical block 4658
May 18 16:15:24 psi15 kernel: lost page write due to I/O error on hda2
May 18 16:15:24 psi15 kernel: end_request: I/O error, dev hda, sector 246117
May 18 16:15:24 psi15 kernel: Buffer I/O error on device hda2, logical block 4659
May 18 16:15:24 psi15 kernel: lost page write due to I/O error on hda2
May 18 16:15:24 psi15 kernel: end_request: I/O error, dev hda, sector 222437
May 18 16:15:24 psi15 kernel: end_request: I/O error, dev hda, sector 222445
May 18 16:15:24 psi15 kernel: end_request: I/O error, dev hda, sector 222453
May 18 16:15:24 psi15 kernel: end_request: I/O error, dev hda, sector 222461
May 18 16:15:24 psi15 kernel: end_request: I/O error, dev hda, sector 222469
May 18 16:15:24 psi15 kernel: end_request: I/O error, dev hda, sector 222477
May 18 16:15:24 psi15 kernel: end_request: I/O error, dev hda, sector 222485
May 18 16:15:24 psi15 kernel: Buffer I/O error on device hda2, logical block 1705
May 18 16:15:24 psi15 kernel: lost page write due to I/O error on hda2
May 18 16:15:24 psi15 kernel: Aborting journal on device hda2.
May 18 16:15:24 psi15 kernel: RAID1 conf printout:
May 18 16:15:24 psi15 kernel:  --- wd:1 rd:2
May 18 16:15:24 psi15 kernel:  disk 0, wo:1, o:0, dev:hda3
May 18 16:15:24 psi15 kernel:  disk 1, wo:0, o:1, dev:hdc2
May 18 16:15:24 psi15 kernel: RAID1 conf printout:
May 18 16:15:24 psi15 kernel:  --- wd:1 rd:2
May 18 16:15:24 psi15 kernel:  disk 1, wo:0, o:1, dev:hdc2
May 18 16:15:54 psi15 kernel: end_request: I/O error, dev hda, sector 212933
May 18 16:15:54 psi15 kernel: Buffer I/O error on device hda2, logical block 511
May 18 16:15:54 psi15 kernel: lost page write due to I/O error on hda2
May 18 16:15:54 psi15 kernel: end_request: I/O error, dev hda, sector 208877
May 18 16:15:54 psi15 kernel: Buffer I/O error on device hda2, logical block 4
May 18 16:15:54 psi15 kernel: lost page write due to I/O error on hda2
May 18 16:15:54 psi15 kernel: end_request: I/O error, dev hda, sector 208861
May 18 16:15:54 psi15 kernel: Buffer I/O error on device hda2, logical block 2
May 18 16:15:54 psi15 kernel: lost page write due to I/O error on hda2
May 18 16:15:54 psi15 kernel: end_request: I/O error, dev hda, sector 208845
May 18 16:15:54 psi15 kernel: Buffer I/O error on device hda2, logical block 0
May 18 16:15:54 psi15 kernel: lost page write due to I/O error on hda2
May 18 16:15:54 psi15 kernel: end_request: I/O error, dev hda, sector 208853
May 18 16:15:54 psi15 kernel: Buffer I/O error on device hda2, logical block 1
May 18 16:15:54 psi15 kernel: lost page write due to I/O error on hda2
May 18 16:15:54 psi15 kernel: end_request: I/O error, dev hda, sector 208869
May 18 16:15:54 psi15 kernel: Buffer I/O error on device hda2, logical block 3
May 18 16:15:54 psi15 kernel: lost page write due to I/O error on hda2
May 18 16:17:51 psi15 kernel: ext3_abort called.
May 18 16:17:51 psi15 kernel: EXT3-fs abort (device hda2): ext3_journal_start: Detected aborted journal
May 18 16:17:51 psi15 kernel: Remounting filesystem read-only
May 18 16:20:01 psi15 kernel: end_request: I/O error, dev hda, sector 212933
May 19 00:16:09 psi15 kernel: Unable to handle kernel NULL pointer dereference at virtual address 00000068
May 19 00:16:09 psi15 kernel:  printing eip:
May 19 00:16:09 psi15 kernel: c0143025
May 19 00:16:09 psi15 kernel: *pde = 00000000
May 19 00:16:09 psi15 kernel: Oops: 0002 [#1]
May 19 00:16:09 psi15 kernel: CPU:    0
May 19 00:16:09 psi15 kernel: EIP:    0060:[<c0143025>]    Tainted: P 
May 19 00:16:09 psi15 kernel: EFLAGS: 00010202
May 19 00:16:09 psi15 kernel: EIP is at drop_buffers+0x35/0xd0
May 19 00:16:09 psi15 kernel: eax: 00000000   ebx: c3d46b70   ecx: c3d46b70   edx: c10a2ad0
May 19 00:16:09 psi15 kernel: esi: 00000001   edi: c10a2ad0   ebp: c3d46b70   esp: dfe39d68
May 19 00:16:09 psi15 kernel: ds: 007b   es: 007b   ss: 0068
May 19 00:16:09 psi15 kernel: Process kswapd0 (pid: 8, threadinfo=dfe38000 task=dfe3ece0)
May 19 00:16:09 psi15 kernel: Stack: c10a2ad0 c10a2ad0 00000001 00000000 c01430ff c10a2ad0 dfe39d8c 00000001
May 19 00:16:09 psi15 kernel:        c10a2ad0 00000000 c01414f6 c10a2ad0 c0130869 c10a2ad0 000000d0 dfe39e68
May 19 00:16:09 psi15 kernel:        c02db53c c02db55c 000000bb c10a2ad0 00000008 00000000 dfe39dc0 dfe39dc0
May 19 00:16:09 psi15 kernel: Call Trace:
May 19 00:16:09 psi15 kernel:  [<c01430ff>] try_to_free_buffers+0x3f/0xa0
May 19 00:16:09 psi15 kernel:  [<c01414f6>] try_to_release_page+0x46/0x50
May 19 00:16:09 psi15 kernel:  [<c0130869>] shrink_list+0x2e9/0x470
May 19 00:16:09 psi15 kernel:  [<c0109f31>] handle_IRQ_event+0x31/0x60
May 19 00:16:09 psi15 kernel:  [<c0130b5e>] shrink_cache+0x16e/0x260
May 19 00:16:09 psi15 kernel:  [<c0131170>] shrink_zone+0x70/0x80
May 19 00:16:09 psi15 kernel:  [<c0131495>] balance_pgdat+0x105/0x1d0
May 19 00:16:09 psi15 kernel:  [<c0131640>] kswapd+0xe0/0xf0
May 19 00:16:09 psi15 kernel:  [<c0131560>] kswapd+0x0/0xf0
May 19 00:16:09 psi15 kernel:  [<c0113690>] autoremove_wake_function+0x0/0x40
May 19 00:16:09 psi15 kernel:  [<c0113690>] autoremove_wake_function+0x0/0x40
May 19 00:16:09 psi15 kernel:  [<c0106e9d>] kernel_thread_helper+0x5/0x18
May 19 00:16:09 psi15 kernel:
May 19 00:16:09 psi15 kernel: Code: 0f ba 68 68 10 8b 13 8b 43 04 83 e2 06 09 d0 75 7a 8b 13 83
May 19 00:16:32 psi15 kernel: end_request: I/O error, dev hda, sector 21579439
May 19 00:16:32 psi15 kernel: raid1: Disk failure on hda4, disabling device.
May 19 00:16:32 psi15 kernel: ^IOperation continuing on 1 devices
May 19 00:16:32 psi15 kernel: raid1: hda4: rescheduling sector 4144
May 19 00:16:32 psi15 kernel: RAID1 conf printout:
May 19 00:16:32 psi15 kernel:  --- wd:1 rd:2
May 19 00:16:32 psi15 kernel:  disk 0, wo:1, o:0, dev:hda4
May 19 00:16:32 psi15 kernel:  disk 1, wo:0, o:1, dev:hdc3
May 19 00:16:32 psi15 kernel: RAID1 conf printout:
May 19 00:16:32 psi15 kernel:  --- wd:1 rd:2
May 19 00:16:32 psi15 kernel:  disk 1, wo:0, o:1, dev:hdc3
May 19 00:16:32 psi15 kernel: raid1: hdc3: redirecting sector 4144 to another mirror


hope this makes some sense to you - maybe it has something to do with power supply? (currently using 350 W - PC has 4 hard drives, PCI Graphics card, network card and that's it...)

if you need any further information feel free to ask for them :)

thx in advance

Philipp
Back to top
View user's profile Send private message
moocha
Watchman
Watchman


Joined: 21 Oct 2003
Posts: 5722

PostPosted: Thu May 19, 2005 6:22 am    Post subject: Reply with quote

Sounds like that drive is dying. I'd start looking for a replacement. Perhaps you could get a trial one to test run for a few days before actually buying it.
About the power starvage - that is a definite possibility. IM(NS)HO a 350W power supply is somewhat underpowered for four hard drives - depending on the drives, of course. For example, Seagate Barracuda 7 drives (which I love because they're so damn silent) are highly power-hungry, had to get a 450W one for a striped array. But if it were just the power supply then all drives would experience failures - or at least two drives, one on each power line. So it's probably one of them dying.
_________________
Military Commissions Act of 2006: http://tinyurl.com/jrcto

"Those who would give up essential liberty to purchase a little temporary safety deserve neither liberty nor safety."
-- attributed to Benjamin Franklin
Back to top
View user's profile Send private message
Display posts from previous:   
Reply to topic    Gentoo Forums Forum Index Kernel & Hardware All times are GMT
Page 1 of 1

 
Jump to:  
You cannot post new topics in this forum
You cannot reply to topics in this forum
You cannot edit your posts in this forum
You cannot delete your posts in this forum
You cannot vote in polls in this forum