Gentoo Forums
Gentoo Forums
Gentoo Forums
Quick Search: in
eth0 timeout when system not being used for a while
View unanswered posts
View posts from last 24 hours

 
Reply to topic    Gentoo Forums Forum Index Networking & Security
View previous topic :: View next topic  
Author Message
laurelraven
n00b
n00b


Joined: 12 Sep 2007
Posts: 17

PostPosted: Mon Jan 12, 2009 4:43 am    Post subject: eth0 timeout when system not being used for a while Reply with quote

I'm having an issue that I can't seem to find any help with online so far (maybe my google-fu needs some work).

I have a system that I built Gentoo on about a year ago, and it had been working fine for the most part, until the last three months or so. Now, when the system is left sitting for a few hours or overnight, sometimes when I get on it, it will not be able to get internet connectivity. Rebooting or restarting /eth/init.d/net.eth0 both fix the problem in the short term, but it happens again. Since it is my girlfriend's computer, and she doesn't want to mess with learning shell commands, I keep getting called in to "fix it".

So, I'm looking for a more permenant solution.

I pulled the logs from the system for today, since it happened again, and tracked down what looks like the error message when it first stops working:

Jan 11 09:45:43 [kernel] irq 23: nobody cared (try booting with the "irqpoll" option)
Jan 11 09:45:43 [kernel] Pid: 0, comm: swapper Tainted: P 2.6.25-gentoo-r7 #1
Jan 11 09:45:43 [kernel] Call Trace:
Jan 11 09:45:43 [kernel] <IRQ> [<ffffffff80250863>] __report_bad_irq+0x30/0x72
Jan 11 09:45:43 [kernel] [<ffffffff80250a91>] note_interrupt+0x1ec/0x22b
Jan 11 09:45:43 [kernel] [<ffffffff80250eb9>] handle_fasteoi_irq+0x70/0x87
Jan 11 09:45:43 [kernel] [<ffffffff8021df82>] do_IRQ+0x52/0xa6
Jan 11 09:45:43 [kernel] [<ffffffff8021bc11>] ret_from_intr+0x0/0xa
Jan 11 09:45:43 [kernel] [<ffffffff802281a3>] ack_apic_level+0x0/0xc
Jan 11 09:45:43 [kernel] [<ffffffff802377cf>] __do_softirq+0x3b/0x90
Jan 11 09:45:43 [kernel] [<ffffffff8021c3bc>] call_softirq+0x1c/0x28
Jan 11 09:45:43 [kernel] [<ffffffff8021dc6c>] do_softirq+0x2c/0x68
Jan 11 09:45:43 [kernel] [<ffffffff8021dfc6>] do_IRQ+0x96/0xa6
Jan 11 09:45:43 [kernel] [<ffffffff8021bc11>] ret_from_intr+0x0/0xa
Jan 11 09:45:43 [kernel] <EOI> [<ffffffff80481954>] unix_poll+0x0/0xa4
Jan 11 09:45:43 [kernel] [<ffffffff8021a93b>] default_idle+0x2e/0x52
Jan 11 09:45:43 [kernel] [<ffffffff8021a936>] default_idle+0x29/0x52
Jan 11 09:45:43 [kernel] [<ffffffff8021a90d>] default_idle+0x0/0x52
Jan 11 09:45:43 [kernel] [<ffffffff8021a992>] cpu_idle+0x33/0x51
Jan 11 09:45:43 [kernel] handlers:
Jan 11 09:45:43 [kernel] [<ffffffff8037b755>] (rhine_interrupt+0x0/0x681)

Then it does the following error about once an hour after that:

Jan 11 10:35:53 [kernel] NETDEV WATCHDOG: eth0: transmit timed out
Jan 11 10:35:53 [kernel] eth0: Transmit timed out, status 1003, PHY status 786d, resetting...
Jan 11 10:35:53 [kernel] eth0: link up, 100Mbps, full-duplex, lpa 0x45E1

When I run '/etc/init.d/net.eth0 restart', it comes up fine and I get the following in the logs:

Jan 11 14:58:47 [kernel] NETDEV WATCHDOG: eth0: transmit timed out
Jan 11 14:58:47 [kernel] eth0: Transmit timed out, status 0003, PHY status 786d, resetting...
Jan 11 14:58:47 [kernel] eth0: link up, 100Mbps, full-duplex, lpa 0x45E1
Jan 11 14:59:32 [su] Successful su for root by macleod
Jan 11 14:59:32 [su] + pts/2 macleod:root
Jan 11 14:59:32 [su] pam_unix(su:session): session opened for user root by (uid=1001)
Jan 11 15:00:01 [cron] (root) CMD (test -x /usr/sbin/run-crons && /usr/sbin/run-crons )
Jan 11 15:00:01 [cron] (root) CMD (rm -f /var/spool/cron/lastrun/cron.hourly)
Jan 11 15:00:26 [rc-scripts] WARNING: you are stopping a boot service.
Jan 11 15:00:26 [sshd] Received signal 15; terminating.
Jan 11 15:00:27 [dhcpcd] eth0: received SIGTERM, stopping
Jan 11 15:00:28 [kernel] eth0: link up, 100Mbps, full-duplex, lpa 0x45E1
Jan 11 15:00:28 [dhcpcd] eth0: dhcpcd 4.0.2 starting
Jan 11 15:00:28 [dhcpcd] eth0: broadcasting for a lease
Jan 11 15:00:28 [dhcpcd] eth0: offered 10.236.143.115 from 10.236.143.113
Jan 11 15:00:28 [dhcpcd] eth0: checking 10.236.143.115 is available on attached networks
Jan 11 15:00:33 [dhcpcd] eth0: acknowledged 10.236.143.115 from 10.236.143.113
Jan 11 15:00:34 [dhcpcd] eth0: leased 10.236.143.115 for 86400 seconds
Jan 11 15:00:34 [dhcdbd] Started up.
Jan 11 15:00:34 [sshd] Server listening on :: port 22.
Jan 11 15:00:34 [sshd] Server listening on 0.0.0.0 port 22.

After that, I'm working fine.

Here is the output of lspci:

00:00.0 Host bridge: VIA Technologies, Inc. K8T890 Host Bridge
00:00.1 Host bridge: VIA Technologies, Inc. K8T890 Host Bridge
00:00.2 Host bridge: VIA Technologies, Inc. K8T890 Host Bridge
00:00.3 Host bridge: VIA Technologies, Inc. K8T890 Host Bridge
00:00.4 Host bridge: VIA Technologies, Inc. K8T890 Host Bridge
00:00.5 PIC: VIA Technologies, Inc. K8T890 I/O APIC Interrupt Controller
00:00.7 Host bridge: VIA Technologies, Inc. K8T890 Host Bridge
00:01.0 PCI bridge: VIA Technologies, Inc. VT8237 PCI bridge [K8T800/K8T890 South]
00:02.0 PCI bridge: VIA Technologies, Inc. K8T890 PCI to PCI Bridge Controller
00:03.0 PCI bridge: VIA Technologies, Inc. K8T890 PCI to PCI Bridge Controller
00:03.1 PCI bridge: VIA Technologies, Inc. K8T890 PCI to PCI Bridge Controller
00:03.2 PCI bridge: VIA Technologies, Inc. K8T890 PCI to PCI Bridge Controller
00:03.3 PCI bridge: VIA Technologies, Inc. K8T890 PCI to PCI Bridge Controller
00:0f.0 IDE interface: VIA Technologies, Inc. VIA VT6420 SATA RAID Controller (rev 80)
00:0f.1 IDE interface: VIA Technologies, Inc. VT82C586A/B/VT82C686/A/B/VT823x/A/C PIPC Bus Master IDE (rev 06)
00:10.0 USB Controller: VIA Technologies, Inc. VT82xxxxx UHCI USB 1.1 Controller (rev 81)
00:10.1 USB Controller: VIA Technologies, Inc. VT82xxxxx UHCI USB 1.1 Controller (rev 81)
00:10.2 USB Controller: VIA Technologies, Inc. VT82xxxxx UHCI USB 1.1 Controller (rev 81)
00:10.3 USB Controller: VIA Technologies, Inc. VT82xxxxx UHCI USB 1.1 Controller (rev 81)
00:10.4 USB Controller: VIA Technologies, Inc. USB 2.0 (rev 86)
00:11.0 ISA bridge: VIA Technologies, Inc. VT8237 ISA bridge [KT600/K8T800/K8T890 South]
00:11.5 Multimedia audio controller: VIA Technologies, Inc. VT8233/A/8235/8237 AC97 Audio Controller (rev 60)
00:12.0 Ethernet controller: VIA Technologies, Inc. VT6102 [Rhine-II] (rev 78)
00:18.0 Host bridge: Advanced Micro Devices [AMD] K8 [Athlon64/Opteron] HyperTransport Technology Configuration
00:18.1 Host bridge: Advanced Micro Devices [AMD] K8 [Athlon64/Opteron] Address Map
00:18.2 Host bridge: Advanced Micro Devices [AMD] K8 [Athlon64/Opteron] DRAM Controller
00:18.3 Host bridge: Advanced Micro Devices [AMD] K8 [Athlon64/Opteron] Miscellaneous Control
02:00.0 VGA compatible controller: nVidia Corporation NV44 [GeForce 6200 TurboCache(TM)] (rev a1)

I am running the '2.6.25-gentoo-r7' kernel, not a genkernel, for amd64.

I hope someone else out there either has seen this and knows what's going on, or at least is willing to help me solve this. Let me know if there is something I'm forgetting to post that might be helpful (the rest of the log, as far as I could see, was just a bunch of cron jobs running, and none running right before this happened). I appriciate any help.
Back to top
View user's profile Send private message
honp
Guru
Guru


Joined: 25 Sep 2006
Posts: 355
Location: Good old Prague, Czech rep.

PostPosted: Mon Jan 12, 2009 8:44 am    Post subject: Reply with quote

Hi, it looks like kernel bug to me, you can announce it...

If you read the message, it writes:
Code:
Jan 11 09:45:43 [kernel] irq 23: nobody cared (try booting with the "irqpoll" option)

so try booting with the "irqpoll" option.

Else the problem is that your card somehow crashes and forget its ip settings. I think that ugly solution is run for example dhclient eth0 each hour or so.
Back to top
View user's profile Send private message
laurelraven
n00b
n00b


Joined: 12 Sep 2007
Posts: 17

PostPosted: Mon Jan 12, 2009 4:22 pm    Post subject: Reply with quote

Okay, I'm not at all sure I'm doing this right, and unfortunately won't know if it worked probably for a while (sometimes it doesn't have this issue for a week or more), but here goes:

I had noticed the "irqpoll" note in the error logs, but thought that it would have only made a difference if it were happening during boot up...I'll give it a try, though. Did not know how to add that as a boot option, but some substantial google searching has lead me to one person saying that you just add it to the end of the kernel line in GRUB.

I've added that, and am going to reboot. I'll post here if it doesn't work, and if it stays going fine for a while, I'll post that too.

Thanks!
Back to top
View user's profile Send private message
Display posts from previous:   
Reply to topic    Gentoo Forums Forum Index Networking & Security All times are GMT
Page 1 of 1

 
Jump to:  
You cannot post new topics in this forum
You cannot reply to topics in this forum
You cannot edit your posts in this forum
You cannot delete your posts in this forum
You cannot vote in polls in this forum