Gentoo Forums
Gentoo Forums
Gentoo Forums
Quick Search: in
SATA drives seem to always fail.
View unanswered posts
View posts from last 24 hours

 
Reply to topic    Gentoo Forums Forum Index Kernel & Hardware
View previous topic :: View next topic  
Author Message
arantius
n00b
n00b


Joined: 03 Jan 2004
Posts: 69

PostPosted: Sun Aug 26, 2007 4:39 pm    Post subject: SATA drives seem to always fail. Reply with quote

From /var/log/kern.log:

Code:
Aug 25 09:34:09 jersey ------------[ cut here ]------------
Aug 25 09:34:09 jersey kernel BUG at lib/radix-tree.c:317!
Aug 25 09:34:09 jersey invalid opcode: 0000 [#1]
Aug 25 09:34:09 jersey Modules linked in: ext3 jbd usb_storage tulip bitrev crc32 ehci_hcd ohci_hcd usbcore sis5513
Aug 25 09:34:09 jersey CPU:    0
Aug 25 09:34:09 jersey EIP:    0060:[<c0225efa>]    Not tainted VLI
Aug 25 09:34:09 jersey EFLAGS: 00010086   (2.6.20-hardened-r6-jersey #8)
Aug 25 09:34:09 jersey EIP is at radix_tree_insert+0x12a/0x140
Aug 25 09:34:09 jersey eax: ffffffff   ebx: 00000000   ecx: 00000026   edx: 00000000
Aug 25 09:34:09 jersey esi: fffffffa   edi: 00000025   ebp: c2a94450   esp: c9adbd4c
Aug 25 09:34:09 jersey ds: 0068   es: 007b   gs: 00d8   ss: 0068
Aug 25 09:34:09 jersey Process rsync (pid: 4142, ti=c9ada000 task=c6ba8580 task.ti=c9ada000)
Aug 25 09:34:09 jersey Stack: c1065b00 0001d025 c5bbea44 c1065b00 c5bbea40 0001d025 0001d025 c0153d5e
Aug 25 09:34:09 jersey c1065b00 c1065b00 000200d2 c0153dc3 000200d2 c1065b00 00000000 c0154192
Aug 25 09:34:09 jersey 000200d2 c5bbea40 c10630a0 c5bbea40 0001d020 00000005 c01b58b9 00000000
Aug 25 09:34:09 jersey Call Trace:
Aug 25 09:34:09 jersey [<c0153d5e>] add_to_page_cache+0x2e/0x80
Aug 25 09:34:09 jersey [<c0153dc3>] add_to_page_cache_lru+0x13/0x30
Aug 25 09:34:09 jersey [<c0154192>] find_or_create_page+0x22/0x90
Aug 25 09:34:09 jersey [<c01b58b9>] reiserfs_prepare_file_region_for_write+0xc9/0x8b0
Aug 25 09:34:09 jersey [<c01cc0d5>] do_journal_end+0x895/0x9f0
Aug 25 09:34:09 jersey [<c01b5530>] reiserfs_submit_file_region_for_write+0x100/0x1f0
Aug 25 09:34:09 jersey [<c01b6357>] reiserfs_file_write+0x2b7/0x510
Aug 25 09:34:09 jersey [<c0170187>] vfs_write+0x87/0x160
Aug 25 09:34:09 jersey [<c0170311>] sys_write+0x41/0x70
Aug 25 09:34:09 jersey [<c0124c60>] syscall_call+0x7/0xb
Aug 25 09:34:09 jersey =======================
Aug 25 09:34:09 jersey Code: 75 33 a9 00 00 20 00 74 96 0f 0b eb fe 8d 74 26 00 8b 44 24 08 89 50 08 e9 32 ff ff ff 0f 0b eb fe b8 f4 ff ff ff e9 76 ff ff ff <0f> 0b eb fe 89 f6 0f 0b eb fe 0f 0b eb fe 90 8d b4 26 00 00 00
Aug 25 09:34:09 jersey EIP: [<c0225efa>] radix_tree_insert+0x12a/0x140 SS:ESP 0068:c9adbd4c


Code:
Aug 26 11:48:46 jersey ------------[ cut here ]------------
Aug 26 11:48:46 jersey kernel BUG at lib/radix-tree.c:317!
Aug 26 11:48:46 jersey invalid opcode: 0000 [#1]
Aug 26 11:48:46 jersey Modules linked in: nfsd exportfs lockd sunrpc tulip bitrev crc32 ehci_hcd sis5513 ohci_hcd usbcore
Aug 26 11:48:46 jersey CPU:    0
Aug 26 11:48:46 jersey EIP:    0060:[<c022649a>]    Not tainted VLI
Aug 26 11:48:46 jersey EFLAGS: 00010086   (2.6.20-hardened-r6-jersey #1)
Aug 26 11:48:46 jersey EIP is at radix_tree_insert+0x12a/0x140
Aug 26 11:48:46 jersey eax: ffffffff   ebx: 00000000   ecx: 00000026   edx: 00000000
Aug 26 11:48:46 jersey esi: fffffffa   edi: 00000025   ebp: c5354630   esp: c93f7c04
Aug 26 11:48:46 jersey ds: 007b   es: 007b   gs: 00d8   ss: 0068
Aug 26 11:48:46 jersey Process cat (pid: 30403, ti=c93f6000 task=ca397a90 task.ti=c93f6000)
Aug 26 11:48:46 jersey Stack: c1193e60 00003ba5 c8c85ac4 c1193e60 c8c85ac0 00003ba5 c93f7d5c c015414e
Aug 26 11:48:46 jersey c1193e60 00000009 00000020 c0194898 000000d0 c93f7c58 c93f7c54 c01afb00
Aug 26 11:48:46 jersey cb5cb800 c8c85ac0 02e2b289 00000000 00003ba4 00000000 00000000 c10cd600
Aug 26 11:48:46 jersey Call Trace:
Aug 26 11:48:46 jersey [<c015414e>] add_to_page_cache+0x2e/0x80
Aug 26 11:48:46 jersey [<c0194898>] mpage_readpages+0x88/0x130
Aug 26 11:48:46 jersey [<c01afb00>] reiserfs_get_block+0x0/0x11c0
Aug 26 11:48:46 jersey [<c015a659>] read_pages+0xd9/0xf0
Aug 26 11:48:46 jersey [<c01afb00>] reiserfs_get_block+0x0/0x11c0
Aug 26 11:48:46 jersey [<c015887e>] __alloc_pages+0x5e/0x300
Aug 26 11:48:46 jersey [<c01b169e>] reiserfs_update_sd_size+0x11e/0x1b0
Aug 26 11:48:46 jersey [<c015a789>] __do_page_cache_readahead+0x119/0x130
Aug 26 11:48:46 jersey [<c015a89f>] blockable_page_cache_readahead+0x3f/0xb0
Aug 26 11:48:46 jersey [<c015a978>] make_ahead_window+0x68/0xb0
Aug 26 11:48:46 jersey [<c015aa48>] page_cache_readahead+0x88/0x190
Aug 26 11:48:46 jersey [<c0154c1f>] do_generic_mapping_read+0x49f/0x6a0
Aug 26 11:48:46 jersey [<c0155087>] generic_file_aio_read+0x167/0x1c0
Aug 26 11:48:46 jersey [<c0154e20>] file_read_actor+0x0/0x100
Aug 26 11:48:46 jersey [<c0170228>] do_sync_read+0xc8/0x120
Aug 26 11:48:46 jersey [<c01475d0>] autoremove_wake_function+0x0/0x40
Aug 26 11:48:46 jersey [<c0170305>] vfs_read+0x85/0x150
Aug 26 11:48:46 jersey [<c0170691>] sys_read+0x41/0x70
Aug 26 11:48:46 jersey [<c0124c60>] syscall_call+0x7/0xb
Aug 26 11:48:46 jersey =======================
Aug 26 11:48:46 jersey Code: 75 33 a9 00 00 20 00 74 96 0f 0b eb fe 8d 74 26 00 8b 44 24 08 89 50 08 e9 32 ff ff ff 0f 0b eb fe b8 f4 ff ff ff e9 76 ff ff ff <0f> 0b eb fe 89 f6 0f 0b eb fe 0f 0b eb fe 90 8d b4 26 00 00 00
Aug 26 11:48:46 jersey EIP: [<c022649a>] radix_tree_insert+0x12a/0x140 SS:ESP 0068:c93f7c04


Code:
Aug 26 12:12:46 jersey ------------[ cut here ]------------
Aug 26 12:12:46 jersey kernel BUG at lib/radix-tree.c:317!
Aug 26 12:12:46 jersey invalid opcode: 0000 [#1]
Aug 26 12:12:46 jersey Modules linked in: tulip bitrev crc32 ehci_hcd sis5513 ohci_hcd usbcore
Aug 26 12:12:46 jersey CPU:    0
Aug 26 12:12:46 jersey EIP:    0060:[<c022649a>]    Not tainted VLI
Aug 26 12:12:46 jersey EFLAGS: 00010086   (2.6.20-hardened-r6-jersey #1)
Aug 26 12:12:46 jersey EIP is at radix_tree_insert+0x12a/0x140
Aug 26 12:12:46 jersey eax: ffffffff   ebx: 00000000   ecx: 00000026   edx: 00000000
Aug 26 12:12:46 jersey esi: fffffffa   edi: 00000025   ebp: c3007a30   esp: c6525c04
Aug 26 12:12:46 jersey ds: 007b   es: 007b   gs: 00d8   ss: 0068
Aug 26 12:12:46 jersey Process cat (pid: 4076, ti=c6524000 task=cc04f560 task.ti=c6524000)
Aug 26 12:12:46 jersey Stack: c1073ba0 00003aa5 cb2598cc c1073ba0 cb2598c8 00003aa5 c6525d5c c015414e
Aug 26 12:12:46 jersey c1073ba0 00000009 00000020 c0194898 000000d0 c6525c58 c6525c54 c01afb00
Aug 26 12:12:46 jersey cb3fa9c0 cb2598c8 01276989 00000000 00003aa4 00000000 00000000 c1160260
Aug 26 12:12:46 jersey Call Trace:
Aug 26 12:12:46 jersey [<c015414e>] add_to_page_cache+0x2e/0x80
Aug 26 12:12:46 jersey [<c0194898>] mpage_readpages+0x88/0x130
Aug 26 12:12:46 jersey [<c01afb00>] reiserfs_get_block+0x0/0x11c0
Aug 26 12:12:46 jersey [<c015a659>] read_pages+0xd9/0xf0
Aug 26 12:12:46 jersey [<c01afb00>] reiserfs_get_block+0x0/0x11c0
Aug 26 12:12:46 jersey [<c015887e>] __alloc_pages+0x5e/0x300
Aug 26 12:12:46 jersey [<c01b169e>] reiserfs_update_sd_size+0x11e/0x1b0
Aug 26 12:12:46 jersey [<c015a789>] __do_page_cache_readahead+0x119/0x130
Aug 26 12:12:46 jersey [<c015a89f>] blockable_page_cache_readahead+0x3f/0xb0
Aug 26 12:12:46 jersey [<c015a978>] make_ahead_window+0x68/0xb0
Aug 26 12:12:46 jersey [<c015aa48>] page_cache_readahead+0x88/0x190
Aug 26 12:12:46 jersey [<c0154c1f>] do_generic_mapping_read+0x49f/0x6a0
Aug 26 12:12:46 jersey [<c0155087>] generic_file_aio_read+0x167/0x1c0
Aug 26 12:12:46 jersey [<c0154e20>] file_read_actor+0x0/0x100
Aug 26 12:12:46 jersey [<c0170228>] do_sync_read+0xc8/0x120
Aug 26 12:12:46 jersey [<c01475d0>] autoremove_wake_function+0x0/0x40
Aug 26 12:12:46 jersey [<c0126909>] do_IRQ+0x49/0x80
Aug 26 12:12:46 jersey [<c0126909>] do_IRQ+0x49/0x80
Aug 26 12:12:46 jersey [<c0124ed3>] common_interrupt+0x23/0x30
Aug 26 12:12:46 jersey [<c0170305>] vfs_read+0x85/0x150
Aug 26 12:12:46 jersey [<c0170691>] sys_read+0x41/0x70
Aug 26 12:12:46 jersey [<c0124c60>] syscall_call+0x7/0xb
Aug 26 12:12:46 jersey =======================
Aug 26 12:12:46 jersey Code: 75 33 a9 00 00 20 00 74 96 0f 0b eb fe 8d 74 26 00 8b 44 24 08 89 50 08 e9 32 ff ff ff 0f 0b eb fe b8 f4 ff ff ff e9 76 ff ff ff <0f> 0b eb fe 89 f6 0f 0b eb fe 0f 0b eb fe 90 8d b4 26 00 00 00
Aug 26 12:12:46 jersey EIP: [<c022649a>] radix_tree_insert+0x12a/0x140 SS:ESP 0068:c6525c04


This last one certainly happened when I was re-assembling the 22nd of 100MB chunks of a 4.7 gig ISO image back together into a single file. Trying to transfer this file over the network was failing, so I used split, transferred the pieces, and was using cat to reassemble. So, this was after 2.2ish gigs had been read and written to the disk in rapid succession.

This is only the last event in a long chain of SATA woes that I've had. It started in my home server machine. The motherboard happened to have a SATA controller embedded in it, and I had heard that SATA was the new big thing, so when I needed a new bigger drive, I tried out SATA. Long story short, I had around 5 drive failures over 18 months, and gave up. I bought a regular IDE drive and haven't had a problem since. I decided something about the SATA controller, its linux driver, or the combination was the real issue.

The last "failed" (the drives were probably never bad) drive had been lying around for quite some time. I never had a chance to get it sold or repurposed or anything. I've recently used it as the cheapest route to having some space to do remote backups on. I've got it in another machine and my existing server in the same room, and I'm trying to get the contents of the first backup loaded, so it can be moved and then just synced incrementally over the internet.

I've had constant troubles and errors. Plain SCP of files or rsync over SSH constantly reported MAC errors when transmitting large files. I set up NFS and had a bit more luck, but still couldn't ship this 4.7 gig file across. The 100mb pieces went, individually, but as mentioned, I can't re-assemble them. The SATA drive doesn't seem to be able to handle it. This is a Seagate drive, which I believe to be relatively reputable, but this is the third in a line of new, RMA, RMA that has exhibited problems like this.


So, after that short novel, does anyone have any advice for me? I'm starting to think that relying on this SATA drive for backup is a losing game. I've had nothing but bad luck with SATA drives under linux.
Back to top
View user's profile Send private message
BradN
Advocate
Advocate


Joined: 19 Apr 2002
Posts: 2391
Location: Wisconsin (USA)

PostPosted: Sun Aug 26, 2007 5:30 pm    Post subject: Reply with quote

SATA drives work fine on my system (using a VIA VT6420 controller). This seems to suggest you have more problems than just the SATA issues though: "I've had constant troubles and errors. Plain SCP of files or rsync over SSH constantly reported MAC errors when transmitting large files."

You could try just exercising the SATA subsystem without using the network and see if the system stability is improved, and it might be a good idea to try a different SATA controller as well.
Back to top
View user's profile Send private message
Display posts from previous:   
Reply to topic    Gentoo Forums Forum Index Kernel & Hardware All times are GMT
Page 1 of 1

 
Jump to:  
You cannot post new topics in this forum
You cannot reply to topics in this forum
You cannot edit your posts in this forum
You cannot delete your posts in this forum
You cannot vote in polls in this forum