View previous topic :: View next topic |
Author |
Message |
grant123 Veteran
Joined: 23 Mar 2005 Posts: 1099
|
Posted: Tue Jun 04, 2024 12:34 pm Post subject: Rebooting on its own |
|
|
My Gentoo system has started rebooting on its own, frequently. There doesn't seem to be anything interesting in /var/log/messages. How should I look into this? |
|
Back to top |
|
|
lekto Apprentice
Joined: 20 Sep 2014 Posts: 207 Location: Ancient Rome
|
Posted: Tue Jun 04, 2024 2:19 pm Post subject: |
|
|
It is summer now (at least in the northern hemisphere); your PC might just overheat, so check temperatures. Also, try to run LiveGUI USB to make sure it isn't your system's fault. |
|
Back to top |
|
|
grant123 Veteran
Joined: 23 Mar 2005 Posts: 1099
|
Posted: Tue Jun 04, 2024 2:39 pm Post subject: |
|
|
I moved the SSD to another system and the behavior is the same unfortunately. Both systems are quite cool to the touch. |
|
Back to top |
|
|
lekto Apprentice
Joined: 20 Sep 2014 Posts: 207 Location: Ancient Rome
|
Posted: Tue Jun 04, 2024 2:46 pm Post subject: |
|
|
It looks like there might be a problem with the system or the SSD.
Cold to the touch might also mean overheating if there is a poor connection between a CPU and a heatsink, but since both machines are cool, it probably isn't. |
|
Back to top |
|
|
grant123 Veteran
Joined: 23 Mar 2005 Posts: 1099
|
Posted: Tue Jun 04, 2024 3:54 pm Post subject: |
|
|
This started happening after a fairly large emerge world update the other day so at this point I'm certain it's software. There must be some way to track it down? I would try a newer kernel but the system won't even stay up long enough to compile it. |
|
Back to top |
|
|
lekto Apprentice
Joined: 20 Sep 2014 Posts: 207 Location: Ancient Rome
|
Posted: Tue Jun 04, 2024 4:09 pm Post subject: |
|
|
I had a "few times" similar issue when something broke after a system update, and I didn't know which package caused that. Run genlop -l and check what has been updated since the last time your system was working flawlessly, and mask them. Downgrade everything Portage wants. Now, make sure your system is stable, then start unmasking and updating packages and testing if your system is stable. After updating some packages, it might be necessary to reboot your machine.
This method always worked for me. If by change this "fairly large emerge world update" was emerge --emptytree @world after changing profile, I think it would be better to restore the system from backup and do emerge --emptytree @world a few packages at the time. |
|
Back to top |
|
|
Hu Administrator
Joined: 06 Mar 2007 Posts: 22598
|
Posted: Tue Jun 04, 2024 4:44 pm Post subject: |
|
|
Did that recent large update include a new kernel? If so, start by dropping back to the kernel from before the update. |
|
Back to top |
|
|
Banana Moderator
Joined: 21 May 2004 Posts: 1711 Location: Germany
|
|
Back to top |
|
|
grant123 Veteran
Joined: 23 Mar 2005 Posts: 1099
|
Posted: Wed Aug 07, 2024 5:45 pm Post subject: |
|
|
I'm still having this problem unfortunately.
I set up lm-sensors and everything is between 22C and 35C including the NVME sensor. Is there a bash one-liner that will dump 'sensors' to a file every 10 seconds so I can check the most recent readings after it reboots?
There must be a way to gather more info on why a system rebooted? |
|
Back to top |
|
|
grant123 Veteran
Joined: 23 Mar 2005 Posts: 1099
|
Posted: Wed Aug 07, 2024 8:16 pm Post subject: |
|
|
After more testing I think this may be due to a USB network adapter overheating and freezing/rebooting the system. Still testing... |
|
Back to top |
|
|
grant123 Veteran
Joined: 23 Mar 2005 Posts: 1099
|
Posted: Wed Aug 07, 2024 10:05 pm Post subject: |
|
|
I tried disabling power save mode on the USB adapter like this but it still reboots:
Code: | echo "on" > "/sys/class/net/eth0/power/control" |
|
|
Back to top |
|
|
Banana Moderator
Joined: 21 May 2004 Posts: 1711 Location: Germany
|
|
Back to top |
|
|
AprilGrimoire n00b
Joined: 05 Jun 2020 Posts: 57
|
Posted: Wed Aug 14, 2024 7:08 pm Post subject: Re: Rebooting on its own |
|
|
grant123 wrote: | My Gentoo system has started rebooting on its own, frequently. There doesn't seem to be anything interesting in /var/log/messages. How should I look into this? |
Is it a Gentoo-specific bug, or does it occur on every distrio or even Windows? I had a similar issue with Archlinux recently, and initially I thought it was an Arch issue. However, later it happened more and more frequently and I finally couldn't bear it anymore. I had a test environment Ubuntu installation on a portable HDD, and run stress tests from there. Eventually I found that this issue happened on every Linux distrio, even WSL, and even native Windows applications some times. If you don't have extra disc space, you can use a portable live image to rule out the possibility of a hardware issue.
BTW in my case the issue could be reproduced (randomly, not every time) by plugging and unplugging the charging cable repeatedly when the battery has less than half the charge and cpu is under stress. |
|
Back to top |
|
|
|