View previous topic :: View next topic |
Author |
Message |
kluvo n00b
![n00b n00b](/images/ranks/rank_rect_0.gif)
![](images/avatars/214239289446aa715b3d1a3.gif)
Joined: 25 Aug 2005 Posts: 28 Location: Bratislava, Slovakia
|
Posted: Wed Mar 19, 2008 10:16 am Post subject: smart - pending sector |
|
|
Hi folks,
I have here one weired pending sector on one of my disks, long and short tests finished successfully several times but the pending sector alert is still reoccuring in log and I'm receiving it in mail every 24h.
Code: | Mar 19 04:12:55 <hostname> smartd[18430]: Device: /dev/hda, 1 Currently unreadable (pending) sectors
Mar 19 04:42:55 <hostname> smartd[18430]: Device: /dev/hda, 1 Currently unreadable (pending) sectors
Mar 19 05:12:56 <hostname> smartd[18430]: Device: /dev/hda, 1 Currently unreadable (pending) sectors
Mar 19 05:42:56 <hostname> smartd[18430]: Device: /dev/hda, 1 Currently unreadable (pending) sectors
Mar 19 06:12:56 <hostname> smartd[18430]: Device: /dev/hda, 1 Currently unreadable (pending) sectors
Mar 19 06:42:55 <hostname> smartd[18430]: Device: /dev/hda, 1 Currently unreadable (pending) sectors
Mar 19 07:12:55 <hostname> smartd[18430]: Device: /dev/hda, 1 Currently unreadable (pending) sectors
Mar 19 07:42:55 <hostname> smartd[18430]: Device: /dev/hda, 1 Currently unreadable (pending) sectors
Mar 19 08:12:55 <hostname> smartd[18430]: Device: /dev/hda, 1 Currently unreadable (pending) sectors
Mar 19 08:42:55 <hostname> smartd[18430]: Device: /dev/hda, 1 Currently unreadable (pending) sectors
Mar 19 09:12:55 <hostname> smartd[18430]: Device: /dev/hda, 1 Currently unreadable (pending) sectors
Mar 19 09:42:55 <hostname> smartd[18430]: Device: /dev/hda, 1 Currently unreadable (pending) sectors
Mar 19 10:12:55 <hostname> smartd[18430]: Device: /dev/hda, 1 Currently unreadable (pending) sectors
|
Funny thing on it all it that there is no other note in log about the sector number, nor in the selftest results, here is the ouptut of smartctl -a /dev/hda
Code: | smartctl version 5.37 [i686-pc-linux-gnu] Copyright (C) 2002-6 Bruce Allen
Home page is http://smartmontools.sourceforge.net/
=== START OF INFORMATION SECTION ===
Device Model: Hitachi HDS721616PLAT80
Serial Number: PV1300Z2SVJAEA
Firmware Version: P22OA85A
User Capacity: 164,696,555,520 bytes
Device is: Not in smartctl database [for details use: -P showall]
ATA Version is: 7
ATA Standard is: ATA/ATAPI-7 T13 1532D revision 1
Local Time is: Wed Mar 19 10:49:24 2008 CET
SMART support is: Available - device has SMART capability.
SMART support is: Enabled
=== START OF READ SMART DATA SECTION ===
SMART overall-health self-assessment test result: PASSED
General SMART Values:
Offline data collection status: (0x84) Offline data collection activity
was suspended by an interrupting command from host.
Auto Offline Data Collection: Enabled.
Self-test execution status: ( 0) The previous self-test routine completed
without error or no self-test has ever
been run.
Total time to complete Offline
data collection: (2865) seconds.
Offline data collection
capabilities: (0x5b) SMART execute Offline immediate.
Auto Offline data collection on/off support.
Suspend Offline collection upon new
command.
Offline surface scan supported.
Self-test supported.
No Conveyance Self-test supported.
Selective Self-test supported.
SMART capabilities: (0x0003) Saves SMART data before entering
power-saving mode.
Supports SMART auto save timer.
Error logging capability: (0x01) Error logging supported.
General Purpose Logging supported.
Short self-test routine
recommended polling time: ( 1) minutes.
Extended self-test routine
recommended polling time: ( 48) minutes.
SMART Attributes Data Structure revision number: 16
Vendor Specific SMART Attributes with Thresholds:
ID# ATTRIBUTE_NAME FLAG VALUE WORST THRESH TYPE UPDATED WHEN_FAILED RAW_VALUE
1 Raw_Read_Error_Rate 0x000b 095 095 016 Pre-fail Always - 327685
2 Throughput_Performance 0x0005 100 100 050 Pre-fail Offline - 0
3 Spin_Up_Time 0x0007 253 253 024 Pre-fail Always - 65 (Average 64)
4 Start_Stop_Count 0x0012 100 100 000 Old_age Always - 56
5 Reallocated_Sector_Ct 0x0033 100 100 005 Pre-fail Always - 1
7 Seek_Error_Rate 0x000b 100 100 067 Pre-fail Always - 0
8 Seek_Time_Performance 0x0005 132 132 020 Pre-fail Offline - 33
9 Power_On_Hours 0x0012 100 100 000 Old_age Always - 4831
10 Spin_Retry_Count 0x0013 100 100 060 Pre-fail Always - 0
12 Power_Cycle_Count 0x0032 100 100 000 Old_age Always - 56
192 Power-Off_Retract_Count 0x0032 100 100 000 Old_age Always - 258
193 Load_Cycle_Count 0x0012 100 100 000 Old_age Always - 258
194 Temperature_Celsius 0x0002 240 240 000 Old_age Always - 25 (Lifetime Min/Max 14/54)
196 Reallocated_Event_Count 0x0032 100 100 000 Old_age Always - 1
197 Current_Pending_Sectorx0022 0 100 100 000 Old_age Always - 1
198 Offline_Uncorrectable 0x0008 100 100 000 Old_age Offline - 0
199 UDMA_CRC_Error_Count 0x000a 200 200 000 Old_age Always - 0
SMART Error Log Version: 1
No Errors Logged
SMART Self-test log structure revision number 1
Num Test_Description Status Remaining LifeTime(hours) LBA_of_first_error
# 1 Short offline Completed without error 00% 4822 -
# 2 Short offline Completed without error 00% 4806 -
# 3 Extended offline Completed without error 00% 4805 -
# 4 Short offline Completed without error 00% 4798 -
# 5 Short offline Completed without error 00% 4773 -
# 6 Extended offline Completed without error 00% 4752 -
# 7 Short offline Completed without error 00% 4749 -
# 8 Short offline Completed without error 00% 4725 -
# 9 Extended offline Completed without error 00% 4709 -
#10 Short offline Completed without error 00% 4701 -
#11 Short offline Completed without error 00% 4677 -
#12 Short offline Completed without error 00% 4653 -
#13 Short offline Completed without error 00% 4629 -
#14 Short offline Completed without error 00% 4605 -
#15 Extended offline Completed without error 00% 4584 -
#16 Short offline Completed without error 00% 4581 -
#17 Short offline Completed without error 00% 4557 -
#18 Short offline Completed without error 00% 4533 -
#19 Short offline Completed without error 00% 4509 -
#20 Short offline Completed without error 00% 4485 -
#21 Short offline Completed without error 00% 4461 -
SMART Selective self-test log data structure revision number 1
SPAN MIN_LBA MAX_LBA CURRENT_TEST_STATUS
1 0 0 Not_testing
2 0 0 Not_testing
3 0 0 Not_testing
4 0 0 Not_testing
5 0 0 Not_testing
Selective self-test flags (0x0):
After scanning selected spans, do NOT read-scan remainder of disk.
If Selective self-test is pending on power-up, resume after 0 minute delay.
|
As you can see all test were successful and no errors were logged at all.
Code: | 5 Reallocated_Sector_Ct 0x0033 100 100 005 Pre-fail Always - 1
196 Reallocated_Event_Count 0x0032 100 100 000 Old_age Always - 1
|
In past there was one rellocated sector, sector number was listed in long test result and it was corrected.
Now there is only Current_Pending_Sector and no sector number mentioned at all.
Code: | 197 Current_Pending_Sector 0x0022 100 100 000 Old_age Always - 1
|
Was someone dealing with similar issue ? |
|
Back to top |
|
![](templates/gentoo/images/spacer.gif) |
kluvo n00b
![n00b n00b](/images/ranks/rank_rect_0.gif)
![](images/avatars/214239289446aa715b3d1a3.gif)
Joined: 25 Aug 2005 Posts: 28 Location: Bratislava, Slovakia
|
Posted: Tue Apr 22, 2008 8:38 am Post subject: new info |
|
|
As I can see no one knows what does it mean ok anyway here is some new progress on the disk status.
After a long time there have been found some new pending sectors so here is the smartctl output:
Code: | # smartctl -l selftest /dev/hda
smartctl version 5.37 [i686-pc-linux-gnu] Copyright (C) 2002-6 Bruce Allen
Home page is http://smartmontools.sourceforge.net/
=== START OF READ SMART DATA SECTION ===
SMART Self-test log structure revision number 1
Num Test_Description Status Remaining LifeTime(hours) LBA_of_first_error
# 1 Extended offline Completed: read failure 10% 5645 321669951
# 2 Extended offline Completed: read failure 10% 5645 321669951
# 3 Short offline Completed without error 00% 5638 -
# 4 Extended offline Completed: read failure 90% 5624 19627208
# 5 Short offline Completed: read failure 10% 5613 321669954
# 6 Extended offline Completed without error 00% 5592 -
# 7 Short offline Completed without error 00% 5589 -
# 8 Short offline Completed without error 00% 5565 -
# 9 Short offline Completed without error 00% 5541 -
#10 Short offline Completed without error 00% 5517 -
#11 Short offline Completed without error 00% 5493 -
#12 Short offline Completed without error 00% 5469 -
#13 Short offline Completed without error 00% 5445 -
#14 Extended offline Completed without error 00% 5424 -
#15 Short offline Completed without error 00% 5421 -
#16 Short offline Completed without error 00% 5397 -
#17 Short offline Completed without error 00% 5373 -
#18 Short offline Completed without error 00% 5349 -
#19 Short offline Completed without error 00% 5325 -
#20 Short offline Completed without error 00% 5301 -
#21 Short offline Completed without error 00% 5277 -
# smartctl -A /dev/hda
smartctl version 5.37 [i686-pc-linux-gnu] Copyright (C) 2002-6 Bruce Allen
Home page is http://smartmontools.sourceforge.net/
=== START OF READ SMART DATA SECTION ===
SMART Attributes Data Structure revision number: 16
Vendor Specific SMART Attributes with Thresholds:
ID# ATTRIBUTE_NAME FLAG VALUE WORST THRESH TYPE UPDATED WHEN_FAILED RAW_VALUE
1 Raw_Read_Error_Rate 0x000b 100 100 016 Pre-fail Always - 0
2 Throughput_Performance 0x0005 100 100 050 Pre-fail Offline - 42
3 Spin_Up_Time 0x0007 253 253 024 Pre-fail Always - 65 (Average 64)
4 Start_Stop_Count 0x0012 100 100 000 Old_age Always - 56
5 Reallocated_Sector_Ct 0x0033 100 100 005 Pre-fail Always - 4
7 Seek_Error_Rate 0x000b 100 100 067 Pre-fail Always - 0
8 Seek_Time_Performance 0x0005 100 100 020 Pre-fail Offline - 54
9 Power_On_Hours 0x0012 100 100 000 Old_age Always - 5647
10 Spin_Retry_Count 0x0013 100 100 060 Pre-fail Always - 0
12 Power_Cycle_Count 0x0032 100 100 000 Old_age Always - 56
192 Power-Off_Retract_Count 0x0032 100 100 000 Old_age Always - 292
193 Load_Cycle_Count 0x0012 100 100 000 Old_age Always - 292
194 Temperature_Celsius 0x0002 200 200 000 Old_age Always - 30 (Lifetime Min/Max 14/54)
196 Reallocated_Event_Count 0x0032 100 100 000 Old_age Always - 4
197 Current_Pending_Sector 0x0022 100 100 000 Old_age Always - 4
198 Offline_Uncorrectable 0x0008 100 100 000 Old_age Offline - 0
199 UDMA_CRC_Error_Count 0x000a 200 200 000 Old_age Always - 0
|
so now I see some sector which seems to be bad, but as we can see on output from fdisk:
Code: | fdisk -lu /dev/hda
Disk /dev/hda: 164.6 GB, 164696555520 bytes
255 heads, 63 sectors/track, 20023 cylinders, total 321672960 sectors
Units = sectors of 1 * 512 = 512 bytes
Disk identifier: 0x00000000
Device Boot Start End Blocks Id System
/dev/hda1 63 208844 104391 fd Linux raid autodetect
/dev/hda2 208845 321669494 160730325 fd Linux raid autodetect
|
the /dev/hda2 ends on sector 321669494 but the bad sector is 321669951 which seems to be in spare area??
I have there LVM volumes on software raid1 and it doesn't seems that the block is used at all, because not all diskspace is used yet.
I would appreciate any suggestions as it definitely looks very weird
Thanks for replies. |
|
Back to top |
|
![](templates/gentoo/images/spacer.gif) |
|
|
You cannot post new topics in this forum You cannot reply to topics in this forum You cannot edit your posts in this forum You cannot delete your posts in this forum You cannot vote in polls in this forum
|
|