View previous topic :: View next topic |
Author |
Message |
slugbait n00b
Joined: 24 Nov 2004 Posts: 22
|
Posted: Fri Feb 13, 2009 1:44 am Post subject: nvidia MCP78S + seagate 1.5TB drive = sata errors |
|
|
I'm building a new desktop and I've run into a problem. I've installed on a 320GB Western Digital drive with no problems:
beast ~ # uname -a
Linux beast 2.6.27-gentoo-r8 #1 SMP PREEMPT Thu Feb 12 12:23:10 EST 2009 x86_64 AMD Phenom(tm) II X4 940 Processor AuthenticAMD GNU/Linux
Now I'm trying to get the 1.5TB Seagate drive configured and mounted, but mkfs.ext3 produces the following in /var/log/messages:
Quote: |
Feb 12 15:13:07 beast ata3: hard resetting link
Feb 12 15:13:08 beast ata3: SATA link up 3.0 Gbps (SStatus 123 SControl 300)
Feb 12 15:13:08 beast ata3.00: configured for UDMA/133
Feb 12 15:13:08 beast ata3: EH complete
Feb 12 15:13:08 beast sd 2:0:0:0: [sdb] 2930277168 512-byte hardware sectors (1500302 MB)
Feb 12 15:13:08 beast sd 2:0:0:0: [sdb] Write Protect is off
Feb 12 15:13:08 beast sd 2:0:0:0: [sdb] Mode Sense: 00 3a 00 00
Feb 12 15:13:08 beast sd 2:0:0:0: [sdb] Write cache: enabled, read cache: enabled, doesn't support DPO or FUA
Feb 12 15:13:09 beast ata3: limiting SATA link speed to 1.5 Gbps
Feb 12 15:13:09 beast ata3.00: exception Emask 0x10 SAct 0x7fffffff SErr 0x400000 action 0x6 frozen
Feb 12 15:13:09 beast ata3.00: irq_stat 0x08000000, interface fatal error
Feb 12 15:13:09 beast ata3: SError: { Handshk }
Feb 12 15:13:09 beast ata3.00: cmd 61/40:00:0f:10:98/00:00:01:00:00/40 tag 0 ncq 32768 out
Feb 12 15:13:09 beast res 40/00:10:7f:09:14/00:00:02:00:00/40 Emask 0x10 (ATA bus error)
Feb 12 15:13:09 beast ata3.00: status: { DRDY }
Feb 12 15:13:09 beast ata3.00: cmd 61/50:08:1f:0c:e0/01:00:01:00:00/40 tag 1 ncq 172032 out
Feb 12 15:13:09 beast res 40/00:10:7f:09:14/00:00:02:00:00/40 Emask 0x10 (ATA bus error)
....
Feb 12 15:13:09 beast ata3: hard resetting link
|
I've updated the BIOS and enabled AHCI.... same errors. I have AHCI enabled in the kernel:
Quote: |
beast ~ # zgrep AHCI /proc/config.gz
CONFIG_SATA_AHCI=y
|
I've spent the last few hours trying to find a solution, but I've obviously failed. Can anyone help?
Quote: |
beast ~ # hdparm -I /dev/sdb
/dev/sdb:
ATA device, with non-removable media
Model Number: ST31500341AS
Serial Number: 9VS1118N
Firmware Revision: CC1H
Transport: Serial
Standards:
Used: unknown (minor revision code 0x0029)
Supported: 8 7 6 5
Likely used: 8
...
|
Quote: |
beast ~ # lspci
00:00.0 RAM memory: nVidia Corporation MCP78S [GeForce 8200] Memory Controller (rev a2)
00:01.0 ISA bridge: nVidia Corporation Device 075d (rev a2)
00:01.1 SMBus: nVidia Corporation MCP78S [GeForce 8200] SMBus (rev a1)
00:01.2 RAM memory: nVidia Corporation MCP78S [GeForce 8200] Memory Controller (rev a1)
00:01.3 Co-processor: nVidia Corporation MCP78S [GeForce 8200] Co-Processor (rev a2)
00:01.4 RAM memory: nVidia Corporation MCP78S [GeForce 8200] Memory Controller (rev a1)
00:02.0 USB Controller: nVidia Corporation MCP78S [GeForce 8200] OHCI USB 1.1 Controller (rev a1)
00:02.1 USB Controller: nVidia Corporation MCP78S [GeForce 8200] EHCI USB 2.0 Controller (rev a1)
00:04.0 USB Controller: nVidia Corporation MCP78S [GeForce 8200] OHCI USB 1.1 Controller (rev a1)
00:04.1 USB Controller: nVidia Corporation MCP78S [GeForce 8200] EHCI USB 2.0 Controller (rev a1)
00:06.0 IDE interface: nVidia Corporation MCP78S [GeForce 8200] IDE (rev a1)
00:07.0 Audio device: nVidia Corporation MCP78S [GeForce 8200] High Definition Audio (rev a1)
00:08.0 PCI bridge: nVidia Corporation MCP78S [GeForce 8200] PCI Bridge (rev a1)
00:09.0 SATA controller: nVidia Corporation MCP78S [GeForce 8200] AHCI Controller (rev a2)
00:0a.0 Ethernet controller: nVidia Corporation MCP78S [GeForce 8200] Ethernet (rev a2)
00:10.0 PCI bridge: nVidia Corporation MCP78S [GeForce 8200] PCI Express Bridge (rev a1)
00:12.0 PCI bridge: nVidia Corporation MCP78S [GeForce 8200] PCI Express Bridge (rev a1)
00:13.0 PCI bridge: nVidia Corporation MCP78S [GeForce 8200] PCI Bridge (rev a1)
00:14.0 PCI bridge: nVidia Corporation MCP78S [GeForce 8200] PCI Bridge (rev a1)
00:18.0 Host bridge: Advanced Micro Devices [AMD] Family 10h [Opteron, Athlon64, Sempron] HyperTransport Configuration
00:18.1 Host bridge: Advanced Micro Devices [AMD] Family 10h [Opteron, Athlon64, Sempron] Address Map
00:18.2 Host bridge: Advanced Micro Devices [AMD] Family 10h [Opteron, Athlon64, Sempron] DRAM Controller
00:18.3 Host bridge: Advanced Micro Devices [AMD] Family 10h [Opteron, Athlon64, Sempron] Miscellaneous Control
00:18.4 Host bridge: Advanced Micro Devices [AMD] Family 10h [Opteron, Athlon64, Sempron] Link Control
01:0a.0 FireWire (IEEE 1394): Agere Systems FW323 (rev 70)
02:00.0 VGA compatible controller: nVidia Corporation GeForce 8400 GS (rev a1)
|
Kernel config is at http://www.severus.org/kernel-config |
|
Back to top |
|
|
pappy_mcfae Watchman
Joined: 27 Dec 2007 Posts: 5999 Location: Pomona, California.
|
Posted: Fri Feb 13, 2009 6:41 am Post subject: |
|
|
Your .config is a mess, slugbait. Please post the results of lspci -n and cat /proc/cpuinfo as well as your /etc/fstab file, and I'll get you working right.
Blessed be!
Pappy _________________ This space left intentionally blank, except for these ASCII symbols. |
|
Back to top |
|
|
slugbait n00b
Joined: 24 Nov 2004 Posts: 22
|
Posted: Fri Feb 13, 2009 1:45 pm Post subject: |
|
|
The config is just the default from genkernel with most of the hardware I don't have disabled. I haven't gone the opposite way and built a completely stripped kernel yet, but I will certainly do so if necessary...
Quote: |
beast ~ # lspci -n
00:00.0 0500: 10de:0754 (rev a2)
00:01.0 0601: 10de:075d (rev a2)
00:01.1 0c05: 10de:0752 (rev a1)
00:01.2 0500: 10de:0751 (rev a1)
00:01.3 0b40: 10de:0753 (rev a2)
00:01.4 0500: 10de:0568 (rev a1)
00:02.0 0c03: 10de:077b (rev a1)
00:02.1 0c03: 10de:077c (rev a1)
00:04.0 0c03: 10de:077d (rev a1)
00:04.1 0c03: 10de:077e (rev a1)
00:06.0 0101: 10de:0759 (rev a1)
00:07.0 0403: 10de:0774 (rev a1)
00:08.0 0604: 10de:075a (rev a1)
00:09.0 0106: 10de:0ad4 (rev a2)
00:0a.0 0200: 10de:0760 (rev a2)
00:10.0 0604: 10de:0778 (rev a1)
00:12.0 0604: 10de:075b (rev a1)
00:13.0 0604: 10de:077a (rev a1)
00:14.0 0604: 10de:077a (rev a1)
00:18.0 0600: 1022:1200
00:18.1 0600: 1022:1201
00:18.2 0600: 1022:1202
00:18.3 0600: 1022:1203
00:18.4 0600: 1022:1204
01:0a.0 0c00: 11c1:5811 (rev 70)
02:00.0 0300: 10de:0422 (rev a1)
|
Quote: |
beast ~ # cat /proc/cpuinfo
processor : 0
vendor_id : AuthenticAMD
cpu family : 16
model : 4
model name : AMD Phenom(tm) II X4 940 Processor
stepping : 2
cpu MHz : 2999.995
cache size : 512 KB
physical id : 0
siblings : 4
core id : 0
cpu cores : 4
apicid : 0
initial apicid : 0
fpu : yes
fpu_exception : yes
cpuid level : 5
wp : yes
flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36
clflush mmx fxsr sse sse2 ht syscall nx mmxext fxsr_opt pdpe1gb rdtscp lm 3dnowext 3dnow c
onstant_tsc rep_good nopl pni monitor cx16 popcnt lahf_lm cmp_legacy svm extapic cr8_legac
y abm sse4a misalignsse 3dnowprefetch osvw ibs skinit wdt
bogomips : 5999.98
TLB size : 1024 4K pages
clflush size : 64
cache_alignment : 64
address sizes : 48 bits physical, 48 bits virtual
power management: ts ttp tm stc 100mhzsteps hwpstate
processor : 1
vendor_id : AuthenticAMD
cpu family : 16
model : 4
model name : AMD Phenom(tm) II X4 940 Processor
stepping : 2
cpu MHz : 2999.995
cache size : 512 KB
physical id : 0
siblings : 4
core id : 1
cpu cores : 4
apicid : 1
initial apicid : 1
fpu : yes
fpu_exception : yes
cpuid level : 5
wp : yes
flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36
clflush mmx fxsr sse sse2 ht syscall nx mmxext fxsr_opt pdpe1gb rdtscp lm 3dnowext 3dnow c
onstant_tsc rep_good nopl pni monitor cx16 popcnt lahf_lm cmp_legacy svm extapic cr8_legac
y abm sse4a misalignsse 3dnowprefetch osvw ibs skinit wdt
bogomips : 6000.00
TLB size : 1024 4K pages
clflush size : 64
cache_alignment : 64
address sizes : 48 bits physical, 48 bits virtual
power management: ts ttp tm stc 100mhzsteps hwpstate
processor : 2
vendor_id : AuthenticAMD
cpu family : 16
model : 4
model name : AMD Phenom(tm) II X4 940 Processor
stepping : 2
cpu MHz : 2999.995
cache size : 512 KB
physical id : 0
siblings : 4
core id : 2
cpu cores : 4
apicid : 2
initial apicid : 2
fpu : yes
fpu_exception : yes
cpuid level : 5
wp : yes
flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36
clflush mmx fxsr sse sse2 ht syscall nx mmxext fxsr_opt pdpe1gb rdtscp lm 3dnowext 3dnow c
onstant_tsc rep_good nopl pni monitor cx16 popcnt lahf_lm cmp_legacy svm extapic cr8_legac
y abm sse4a misalignsse 3dnowprefetch osvw ibs skinit wdt
bogomips : 5999.99
TLB size : 1024 4K pages
clflush size : 64
cache_alignment : 64
address sizes : 48 bits physical, 48 bits virtual
power management: ts ttp tm stc 100mhzsteps hwpstate
processor : 3
vendor_id : AuthenticAMD
cpu family : 16
model : 4
model name : AMD Phenom(tm) II X4 940 Processor
stepping : 2
cpu MHz : 2999.995
cache size : 512 KB
physical id : 0
siblings : 4
core id : 3
cpu cores : 4
apicid : 3
initial apicid : 3
fpu : yes
fpu_exception : yes
cpuid level : 5
wp : yes
flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36
clflush mmx fxsr sse sse2 ht syscall nx mmxext fxsr_opt pdpe1gb rdtscp lm 3dnowext 3dnow c
onstant_tsc rep_good nopl pni monitor cx16 popcnt lahf_lm cmp_legacy svm extapic cr8_legac
y abm sse4a misalignsse 3dnowprefetch osvw ibs skinit wdt
bogomips : 5999.99
TLB size : 1024 4K pages
clflush size : 64
cache_alignment : 64
address sizes : 48 bits physical, 48 bits virtual
power management: ts ttp tm stc 100mhzsteps hwpstate
|
I'm not sure why this matters, but:
Quote: |
/dev/sda1 /boot ext2 noauto,noatime 1 2
/dev/sda3 / ext3 noatime 0 1
/dev/sda2 none swap sw 0 0
/dev/cdrom /mnt/cdrom auto noauto,ro 0 0
shm /dev/shm tmpfs nodev,nosuid,noexec 0 0
|
/dev/sdb1 will eventually be mounted at /home when everything is working. |
|
Back to top |
|
|
pappy_mcfae Watchman
Joined: 27 Dec 2007 Posts: 5999 Location: Pomona, California.
|
Posted: Fri Feb 13, 2009 9:09 pm Post subject: |
|
|
Your /etc/fstab matters because the kernel must know which file systems you are using. Without that info, welcome to Kernel-panic-opolis.
As per your original setup, having the SATA drivers and the ATA/ATAPI/MFM/RLL drivers active is a recipe for conflicts and all manner of disaster. This may be a reason for your drive issues.
It is also possible that you have a bad drive. Yes, drives can be bad fresh out of the box. It doesn't take much shock to crush the read heads...spin up once, and you've just cut all kinds of grooves into the surface of the media. It's not pretty. If this kernel doesn't bring said drive around, take it back.
Click here for your new .config. Compile as is.
For the best results, please do the following:
1) Move your .config file out of your kernel source directory ( 2.6.27-gentoo-r8 ).
2) Issue the command make mrproper. This is a destructive step. It returns the source to pristine condition. Unmoved .config files will be deleted!
3) Copy my .config into your source directory.
4) Issue the command make && make modules_install.
5) Install the kernel as you normally would, and reboot.
6) Once it boots, please post /var/log/dmesg so I can see how things loaded.
As I said, this may or may not snap the drive into shape. What it will do is once you get that drive issue fixed, is give you a fast running Linux kernel. That's always a good thing.
Blessed be!
Pappy _________________ This space left intentionally blank, except for these ASCII symbols. |
|
Back to top |
|
|
slugbait n00b
Joined: 24 Nov 2004 Posts: 22
|
|
Back to top |
|
|
pappy_mcfae Watchman
Joined: 27 Dec 2007 Posts: 5999 Location: Pomona, California.
|
Posted: Sat Feb 14, 2009 12:11 am Post subject: |
|
|
Good, then we have proved the 1.5TB drive to be bad, or its cables. Check to make sure the power and interface cables are all tight. If they are, take the drive back and get a new one, then retry.
Blessed be!
Pappy _________________ This space left intentionally blank, except for these ASCII symbols. |
|
Back to top |
|
|
naelq Tux's lil' helper
Joined: 18 May 2006 Posts: 146
|
Posted: Sat Feb 14, 2009 12:54 am Post subject: |
|
|
slugbait, could you please try the HDD with other distro, say ubuntu live-cd & report back? (just to make sure that it's ain't any missing option/configuration)
nael _________________ main: Intel Xeon x3440 / Intel S3420GPLC / 6x 2GB DDR3 ECC REG / nVIDIA G210 / 3x 250GB AAKS || 2x 1TB FALS / Audigy 2 ZS / PCP&C 610w
laptop: Apple MacBook White T7200 / 2GB / 30GB Vertix |
|
Back to top |
|
|
pappy_mcfae Watchman
Joined: 27 Dec 2007 Posts: 5999 Location: Pomona, California.
|
Posted: Sat Feb 14, 2009 12:56 am Post subject: |
|
|
It is NOT missing anything with a Pappy seed.
Blessed be!
Pappy _________________ This space left intentionally blank, except for these ASCII symbols. |
|
Back to top |
|
|
naelq Tux's lil' helper
Joined: 18 May 2006 Posts: 146
|
Posted: Sat Feb 14, 2009 1:03 am Post subject: |
|
|
no offense mate, but i would NOT rush with
Quote: | Good, then we have proved the 1.5TB drive to be bad |
anyway, as you say!!
nael _________________ main: Intel Xeon x3440 / Intel S3420GPLC / 6x 2GB DDR3 ECC REG / nVIDIA G210 / 3x 250GB AAKS || 2x 1TB FALS / Audigy 2 ZS / PCP&C 610w
laptop: Apple MacBook White T7200 / 2GB / 30GB Vertix |
|
Back to top |
|
|
pappy_mcfae Watchman
Joined: 27 Dec 2007 Posts: 5999 Location: Pomona, California.
|
Posted: Sat Feb 14, 2009 1:10 am Post subject: |
|
|
After dealing with many peoples' dmegs, I'm pretty sure I know what a bad drive looks like.
And no offense taken, but when I set out to help someone, I am thorough, if nothing else.
Blessed be!
Pappy _________________ This space left intentionally blank, except for these ASCII symbols. |
|
Back to top |
|
|
darklegion Guru
Joined: 14 Nov 2004 Posts: 468
|
Posted: Sat Feb 14, 2009 3:22 pm Post subject: |
|
|
Some of Seagate's recent drives have been having serious firmware issues.I know it occurred with larger drives, so it's possible that yours is affected. |
|
Back to top |
|
|
Monkeh Veteran
Joined: 06 Aug 2005 Posts: 1656 Location: England
|
Posted: Sat Feb 14, 2009 3:26 pm Post subject: |
|
|
pappy_mcfae wrote: | And no offense taken, but when I set out to help someone, I am thorough, if nothing else. |
Well you've missed the important stuff.
Install smartmontools and post the output of smartctl -ia /dev/sdb. |
|
Back to top |
|
|
pappy_mcfae Watchman
Joined: 27 Dec 2007 Posts: 5999 Location: Pomona, California.
|
Posted: Sat Feb 14, 2009 7:12 pm Post subject: |
|
|
Those tools would only tell me what /var/log/dmesg already did. The only other test I would do is to listen to the drive as it's running. If it ticks, voila, you have the proof. If it's quiet, as in no motor running, you have even better proof.
Blessed be!
Pappy _________________ This space left intentionally blank, except for these ASCII symbols. |
|
Back to top |
|
|
Monkeh Veteran
Joined: 06 Aug 2005 Posts: 1656 Location: England
|
Posted: Sat Feb 14, 2009 7:31 pm Post subject: |
|
|
pappy_mcfae wrote: | Those tools would only tell me what /var/log/dmesg already did. |
No, they'll tell you quite a lot more..
Quote: | The only other test I would do is to listen to the drive as it's running. If it ticks, voila, you have the proof. |
Of what? A head unload? A brief seek? There is only one useful diagnostic tool: SMART. |
|
Back to top |
|
|
rapsure Apprentice
Joined: 29 Apr 2004 Posts: 172 Location: Logan, UT USA
|
Posted: Sun Feb 15, 2009 6:27 am Post subject: |
|
|
Check your hard drive model and firmware and search seagate's tech support. The was a period where the 1.5TB drives were known to do just what you are experiencing. _________________ Hindi ko naintindihan, pakiulit. Sometimes my code works. |
|
Back to top |
|
|
slugbait n00b
Joined: 24 Nov 2004 Posts: 22
|
Posted: Sun Feb 15, 2009 6:54 am Post subject: |
|
|
I booted from an ubuntu livecd yesterday and tried to run mkfs.ext3 and got the same slew of errors. It must be a bad drive, right? I bought another drive today and installed/formatted it with no problems:
beast ~ # df -h
Filesystem Size Used Avail Use% Mounted on
/dev/sda3 286G 5.4G 266G 2% /
udev 10M 88K 10M 1% /dev
shm 3.9G 0 3.9G 0% /dev/shm
/dev/sdb1 1.4T 99G 1.2T 8% /home
I started the transfer of my home directory on my current desktop (367GB worth of various file sizes) a few hours ago and went upstairs to watch some DVDs. I just came back down to find everything running smoothly on the surface, but /var/log/messages says otherwise:
Quote: |
Feb 14 23:05:58 beast [ 6969.924777] ata2.00: exception Emask 0x10 SAct 0x0 SErr 0x400000 action 0x6 frozen
Feb 14 23:05:58 beast [ 6969.924786] ata2.00: irq_stat 0x08000000, interface fatal error
Feb 14 23:05:58 beast [ 6969.924788] ata2: SError: { Handshk }
Feb 14 23:05:58 beast [ 6969.924792] ata2.00: cmd 35/00:00:bf:ef:a6/00:04:39:00:00/e0 tag 0 dma 524288 out
Feb 14 23:05:58 beast [ 6969.924793] res 50/00:00:be:ef:a6/00:00:39:00:00/e0 Emask 0x10 (ATA bus error)
Feb 14 23:05:58 beast [ 6969.924795] ata2.00: status: { DRDY }
Feb 14 23:05:58 beast [ 6969.924798] ata2: hard resetting link
Feb 14 23:05:58 beast [ 6970.229386] ata2: SATA link up 3.0 Gbps (SStatus 123 SControl 300)
Feb 14 23:06:03 beast [ 6975.229013] ata2.00: qc timeout (cmd 0xec)
Feb 14 23:06:03 beast [ 6975.229025] ata2.00: failed to IDENTIFY (I/O error, err_mask=0x4)
Feb 14 23:06:03 beast [ 6975.229029] ata2.00: revalidation failed (errno=-5)
Feb 14 23:06:03 beast [ 6975.229032] ata2: hard resetting link
Feb 14 23:06:03 beast [ 6975.534008] ata2: SATA link up 3.0 Gbps (SStatus 123 SControl 300)
Feb 14 23:06:03 beast [ 6975.537042] ata2.00: configured for UDMA/133
Feb 14 23:06:03 beast [ 6975.537053] ata2: EH complete
Feb 14 23:06:04 beast [ 6975.541041] sd 1:0:0:0: [sdb] 2930277168 512-byte hardware sectors (1500302 MB)
Feb 14 23:06:04 beast [ 6975.543602] sd 1:0:0:0: [sdb] Write Protect is off
Feb 14 23:06:04 beast [ 6975.543608] sd 1:0:0:0: [sdb] Mode Sense: 00 3a 00 00
Feb 14 23:06:04 beast [ 6975.549726] sd 1:0:0:0: [sdb] Write cache: enabled, read cache: enabled, doesn't support DPO or FUA
|
Quote: |
Feb 14 23:29:34 beast [ 8386.201314] ata2.00: exception Emask 0x10 SAct 0x0 SErr 0x400000 action 0x6 frozen
Feb 14 23:29:34 beast [ 8386.201323] ata2.00: irq_stat 0x08000000, interface fatal error
Feb 14 23:29:34 beast [ 8386.201325] ata2: SError: { Handshk }
Feb 14 23:29:34 beast [ 8386.201329] ata2.00: cmd 35/00:00:77:a0:49/00:04:3c:00:00/e0 tag 0 dma 524288 out
Feb 14 23:29:34 beast [ 8386.201329] res 50/00:00:76:a0:49/00:00:3c:00:00/e0 Emask 0x10 (ATA bus error)
Feb 14 23:29:34 beast [ 8386.201349] ata2.00: status: { DRDY }
Feb 14 23:29:34 beast [ 8386.201358] ata2: hard resetting link
Feb 14 23:29:34 beast [ 8386.506011] ata2: SATA link up 3.0 Gbps (SStatus 123 SControl 300)
Feb 14 23:29:39 beast [ 8391.506387] ata2.00: qc timeout (cmd 0xec)
Feb 14 23:29:39 beast [ 8391.506398] ata2.00: failed to IDENTIFY (I/O error, err_mask=0x4)
Feb 14 23:29:39 beast [ 8391.506402] ata2.00: revalidation failed (errno=-5)
Feb 14 23:29:39 beast [ 8391.506405] ata2: hard resetting link
Feb 14 23:29:40 beast [ 8391.811012] ata2: SATA link up 3.0 Gbps (SStatus 123 SControl 300)
Feb 14 23:29:40 beast [ 8391.814034] ata2.00: configured for UDMA/133
Feb 14 23:29:40 beast [ 8391.814050] ata2: EH complete
Feb 14 23:29:40 beast [ 8391.819339] sd 1:0:0:0: [sdb] 2930277168 512-byte hardware sectors (1500302 MB)
Feb 14 23:29:40 beast [ 8391.821388] sd 1:0:0:0: [sdb] Write Protect is off
Feb 14 23:29:40 beast [ 8391.821397] sd 1:0:0:0: [sdb] Mode Sense: 00 3a 00 00
Feb 14 23:29:40 beast [ 8391.826825] sd 1:0:0:0: [sdb] Write cache: enabled, read cache: enabled, doesn't support DPO or FUA
|
So... we're back where we started, just not as often. I started the dump at about 22:30 and it's still going as of 0140 with only these two burps. Here is the output from smartctl after copying 108GB and the two errors reported above:
Quote: |
beast ~ # smartctl -ia /dev/sdb
smartctl version 5.38 [x86_64-pc-linux-gnu] Copyright (C) 2002-8 Bruce Allen
Home page is http://smartmontools.sourceforge.net/
=== START OF INFORMATION SECTION ===
Device Model: ST31500341AS
Serial Number: 9VS04Q6Q
Firmware Version: SD19
User Capacity: 1,500,301,910,016 bytes
Device is: Not in smartctl database [for details use: -P showall]
ATA Version is: 8
ATA Standard is: ATA-8-ACS revision 4
Local Time is: Sun Feb 15 01:43:32 2009 EST
SMART support is: Available - device has SMART capability.
SMART support is: Enabled
=== START OF READ SMART DATA SECTION ===
SMART overall-health self-assessment test result: PASSED
General SMART Values:
Offline data collection status: (0x82) Offline data collection activity
was completed without error.
Auto Offline Data Collection: Enabled.
Self-test execution status: ( 0) The previous self-test routine completed
without error or no self-test has ever
been run.
Total time to complete Offline
data collection: ( 600) seconds.
Offline data collection
capabilities: (0x7b) SMART execute Offline immediate.
Auto Offline data collection on/off supp
ort.
Suspend Offline collection upon new
command.
Offline surface scan supported.
Self-test supported.
Conveyance Self-test supported.
Selective Self-test supported.
SMART capabilities: (0x0003) Saves SMART data before entering
power-saving mode.
Supports SMART auto save timer.
Error logging capability: (0x01) Error logging supported.
General Purpose Logging supported.
Short self-test routine
recommended polling time: ( 1) minutes.
Extended self-test routine
recommended polling time: ( 255) minutes.
Conveyance self-test routine
recommended polling time: ( 2) minutes.
SCT capabilities: (0x103b) SCT Status supported.
SCT Feature Control supported.
SCT Data Table supported.
SMART Attributes Data Structure revision number: 10
Vendor Specific SMART Attributes with Thresholds:
ID# ATTRIBUTE_NAME FLAG VALUE WORST THRESH TYPE UPDATED WHEN_
FAILED RAW_VALUE
1 Raw_Read_Error_Rate 0x000f 102 100 006 Pre-fail Always -
4626928
3 Spin_Up_Time 0x0003 097 097 000 Pre-fail Always -
0
4 Start_Stop_Count 0x0032 100 100 020 Old_age Always -
3
5 Reallocated_Sector_Ct 0x0033 100 100 036 Pre-fail Always -
0
7 Seek_Error_Rate 0x000f 100 253 030 Pre-fail Always -
148663
9 Power_On_Hours 0x0032 100 100 000 Old_age Always -
4
10 Spin_Retry_Count 0x0013 100 100 097 Pre-fail Always -
0
12 Power_Cycle_Count 0x0032 100 100 020 Old_age Always -
3
184 Unknown_Attribute 0x0032 100 100 099 Old_age Always -
0
187 Reported_Uncorrect 0x0032 100 100 000 Old_age Always -
0
188 Unknown_Attribute 0x0032 100 099 000 Old_age Always -
131074
189 High_Fly_Writes 0x003a 099 099 000 Old_age Always -
1
190 Airflow_Temperature_Cel 0x0022 068 066 045 Old_age Always -
32 (Lifetime Min/Max 19/34)
194 Temperature_Celsius 0x0022 032 040 000 Old_age Always -
32 (0 19 0 0)
195 Hardware_ECC_Recovered 0x001a 030 030 000 Old_age Always -
4626928
197 Current_Pending_Sector 0x0012 100 100 000 Old_age Always -
0
198 Offline_Uncorrectable 0x0010 100 100 000 Old_age Offline -
0
199 UDMA_CRC_Error_Count 0x003e 200 200 000 Old_age Always -
2
SMART Error Log Version: 1
No Errors Logged
SMART Self-test log structure revision number 1
No self-tests have been logged. [To run self-tests, use: smartctl -t]
SMART Selective self-test log data structure revision number 1
SPAN MIN_LBA MAX_LBA CURRENT_TEST_STATUS
1 0 0 Not_testing
2 0 0 Not_testing
3 0 0 Not_testing
4 0 0 Not_testing
5 0 0 Not_testing
Selective self-test flags (0x0):
After scanning selected spans, do NOT read-scan remainder of disk.
If Selective self-test is pending on power-up, resume after 0 minute delay.
|
I did my homework before buying this motherboard to make sure the chipset was supported, but I didn't dig as deeply into the recent history of Seagate's big drives. I'd read about the dead drive issues, but all of the articles I read indicated that the problem was resolved in the firmware updates. I still have the original drive, so I plan to install it in my current desktop tomorrow to try to eliminate one more variable. I have a WindowsXP partition on this system, so I'll even be able to try that... |
|
Back to top |
|
|
slugbait n00b
Joined: 24 Nov 2004 Posts: 22
|
Posted: Sun Feb 15, 2009 7:06 am Post subject: |
|
|
Heh... I just noticed something:
Quote: |
Device Model: ST31500341AS
Serial Number: 9VS04Q6Q
Firmware Version: SD19
|
from the dmesg:
Quote: |
[ 1.733009] ata2: SATA link up 3.0 Gbps (SStatus 123 SControl 300)
[ 1.734426] ata2.00: ATA-8: ST31500341AS, SD19, max UDMA/133
[ 1.734503] ata2.00: 2930277168 sectors, multi 0: LBA48 NCQ (not used)
[ 1.734584] ata2.00: WARNING: device requires firmware update to be fully functional.
[ 1.734699] ata2.00: contact the vendor or visit http://ata.wiki.kernel.org.
[ 1.736449] ata2.00: configured for UDMA/133
|
[url]
http://ata.wiki.kernel.org/index.php/Known_issues#Seagate_harddrives_which_time_out_FLUSH_CACHE_when_NCQ_is_being_used
[/url]
"problem happens most frequently on this model" doesn't sound so good... |
|
Back to top |
|
|
naelq Tux's lil' helper
Joined: 18 May 2006 Posts: 146
|
Posted: Sun Feb 15, 2009 12:27 pm Post subject: |
|
|
at least the drive is NOT faulty good luck!
nael _________________ main: Intel Xeon x3440 / Intel S3420GPLC / 6x 2GB DDR3 ECC REG / nVIDIA G210 / 3x 250GB AAKS || 2x 1TB FALS / Audigy 2 ZS / PCP&C 610w
laptop: Apple MacBook White T7200 / 2GB / 30GB Vertix |
|
Back to top |
|
|
Monkeh Veteran
Joined: 06 Aug 2005 Posts: 1656 Location: England
|
Posted: Sun Feb 15, 2009 3:06 pm Post subject: |
|
|
Well the drive isn't physically faulty. It could be bad firmware, bad PHY, or just a bad cable or connection.
Try using a different SATA cable on it, and make sure it's secure. If it still fails, get it replaced. |
|
Back to top |
|
|
slugbait n00b
Joined: 24 Nov 2004 Posts: 22
|
Posted: Sun Feb 15, 2009 7:45 pm Post subject: |
|
|
This is becoming absurd. I just tried to use Seagate's iso (Brinks-4D8H-SD1B.ISO) to update the firmware on the second drive from SD19 to SD1B. It boots and loads the detection program but it doesn't see the drive. Apparently seagate has decided that "only a small batch of drives" were b0rked, so the firmware update program will only detect and upgrade drives with certain serial numbers. Meanwhile, libata disables NCQ as soon as it sees the SD19 firmware.
Seagate is now officially on my vendor shit-list. I've wasted far too much time trying to fix this problem. |
|
Back to top |
|
|
slugbait n00b
Joined: 24 Nov 2004 Posts: 22
|
Posted: Sun Feb 15, 2009 7:47 pm Post subject: |
|
|
"Check the cable" is one of the first things I did, by the way... I've used 4 different cables on 3 different SATA ports on the motherboard. |
|
Back to top |
|
|
Drone1 Apprentice
Joined: 27 Sep 2005 Posts: 232 Location: United States of Texas
|
Posted: Sun Feb 15, 2009 8:31 pm Post subject: |
|
|
slugbait
We have 4 of ST31500341NS (NS i thnk; they were listed as NOT having the issue but we were seeing horrible performance issues and strange RAID behavior) drives at work and only 3 would upgrade the firmware to the SD1B version on the intended system. Tried updating the 4th drive on a completely different system, and it updated WITHOUT issue. Intended system is now running as we have planned....
If you have another system with SATA, try to update the HD firmware using that system. There is more going on with the seagate drives than what is on the forums and on the tech sites. It updates on one system but not another? And yes, I'm dealing with different arch's/proc's/chipset brands between those 2 systems to get the HD's firmware updated. _________________ The GUI has become stale to me.... Where can I find the next interface leap forward? |
|
Back to top |
|
|
Monkeh Veteran
Joined: 06 Aug 2005 Posts: 1656 Location: England
|
Posted: Sun Feb 15, 2009 9:31 pm Post subject: |
|
|
Try using legacy IDE mode for the firmware update instead of AHCI mode, if it supports it. |
|
Back to top |
|
|
|