View previous topic :: View next topic |
Author |
Message |
piewie Guru
Joined: 26 Sep 2002 Posts: 494 Location: old Europe
|
Posted: Sat Oct 05, 2024 6:55 pm Post subject: system start hangs with gentoo on nvme M.2 [solved] |
|
|
I am booting from an usb stick und open a fully encryted /-partition. The current kernel is gentoo-sources 6.10.10. I have one /-partition on a SSD and one on a nvme M.2 disk. I can start my system with a root partition on the ssd, but on nvme it hangs. cryptseup luksOpen dialog was fine (same kernel). The nvme is generally working, when the system from the ssd was started. The nvme disk is directly connected (I suppose via S4LV008) to the cpu on a Gigabyte Z790D.
Code: | 0000:00:0e.0 RAID bus controller: Intel Corporation Volume Management Device NVMe RAID Controller
10000:e1:00.0 Non-Volatile memory controller: Samsung Electronics Co Ltd NVMe SSD Controller S4LV008[Pascal] (prog-if 02 [NVM Express])
Subsystem: Samsung Electronics Co Ltd Device a801
Flags: bus master, fast devsel, latency 0
Memory at 58000000 (64-bit, non-prefetchable) [size=16K]
Capabilities: [40] Power Management version 3
Capabilities: [50] MSI: Enable- Count=1/32 Maskable- 64bit+
Capabilities: [70] Express Endpoint, IntMsgNum 0
Capabilities: [b0] MSI-X: Enable+ Count=17 Masked-
Capabilities: [100] Advanced Error Reporting
Capabilities: [168] Secondary PCI Express
Capabilities: [188] Physical Layer 16.0 GT/s <?>
Capabilities: [1ac] Lane Margining at the Receiver
Capabilities: [1c4] Latency Tolerance Reporting
Capabilities: [1cc] L1 PM Substates
Capabilities: [350] Data Link Feature <?>
Kernel driver in use: nvme
|
syslog:
Code: | kernel: Non-volatile memory driver v1.3
kernel: ahci 10000:e0:17.0: version 3.0
kernel: ahci 10000:e0:17.0: can't derive routing for PCI INT A
kernel: nvme nvme0: pci function 10000:e1:00.0
kernel: ahci 10000:e0:17.0: PCI INT A: no GSI
kernel: pcieport 10000:e0:06.0: can't derive routing for PCI INT A
kernel: nvme 10000:e1:00.0: PCI INT A: no GSI
kernel: nvme nvme0: D3 entry latency set to 10 seconds
kernel: nvme nvme0: 16/0/0 default/read/poll queues
kernel: Loading firmware: rtl_nic/rtl8125b-2.fw
kernel: RTL8226B_RTL8221B 2.5Gbps PHY r8169-0-300:00: attached PHY driver (mii_bus:phy_addr=r8169-0-300:00, irq=MAC)
kernel: [drm] Initialized nvidia-drm 0.0.0 20160202 for 0000:01:00.0 on minor 0
kernel: ip (4191) used greatest stack depth: 11408 bytes left
kernel: r8169 0000:03:00.0 enp4s0: Link is Down
kernel: r8169 0000:03:00.0 enp4s0: Link is Up - 1Gbps/Full - flow control rx/tx
kernel: (udev-worker) (3257) used greatest stack depth: 10520 bytes left
|
On the monitor the boot process stops at /etc/init.d/procfs. I can't ssh or ping the system, yet.
rc.log:
Code: | rc default logging started at Sat Oct 5 02:35:38 2024
* Executing: /lib/rc/sh/openrc-run.sh /lib/rc/sh/openrc-run.sh /etc/init.d/chronyd start
* Starting chronyd ...
* start-stop-daemon: fopen `/run/chrony/chronyd.pid': No such file or directory
* Detaching to start `/usr/sbin/chronyd' ...
[ ok ]
* Executing: /lib/rc/sh/openrc-run.sh /lib/rc/sh/openrc-run.sh /etc/init.d/sysklogd start
* Starting sysklogd ...
* start-stop-daemon: fopen `/run/sysklogd.pid': No such file or directory
* Detaching to start `/usr/sbin/syslogd' ...
[ ok ]
* Executing: /lib/rc/sh/openrc-run.sh /lib/rc/sh/openrc-run.sh /etc/init.d/cronie start
* Starting cronie ...
* start-stop-daemon: fopen `/run/crond.pid': No such file or directory
* Detaching to start `/usr/sbin/crond' ...
[ ok ]
* Executing: /lib/rc/sh/openrc-run.sh /lib/rc/sh/openrc-run.sh /etc/init.d/net.enp4s0 start
* Bringing up interface enp4s0
* Caching network module dependencies
* Skipping module adsl due to missing program: adsl-start pppoe-start
* Skipping module br2684ctl due to missing program: br2684ctl
* Skipping module clip due to missing program: atmsigd
* Skipping module ethtool due to missing program: ethtool
* Skipping module netplugd due to missing program: netplugd
* Skipping module ifplugd due to missing program: ifplugd
* Skipping module ipppd due to missing program: ipppd
* Skipping module iwconfig due to missing program: iwconfig
* Skipping module iwd due to missing program: /usr/libexec/iwd
* Skipping module iw due to missing program: iw
* Skipping module firewalld due to missing program: firewall-cmd
* Skipping module pppd due to missing program: pppd
* Skipping module udhcpc due to missing program: busybox
* Skipping module dhclient due to missing program: dhclient
* Skipping module dhclientv6 due to missing program: dhclient
* Skipping module wireguard due to missing program: wg
* Loaded modules: apipa arping bonding l2tp tuntap bridge ccwgroup dummy hsr macvlan macchanger macnet qmi wpa_supplicant ssidnet iproute2 system vlan dhcpcd ip6rd ip6to4 veth
* ip link set dev enp4s0 up
* Configuring enp4s0 for MAC address 74:56 ...
[ ok ]
[ ok ]
* ip -4 route flush table cache dev enp4s0
* ip -6 route flush table cache dev enp4s0
* Executing: /lib/rc/sh/openrc-run.sh /lib/rc/sh/openrc-run.sh /etc/init.d/netmount start
* Mounting network filesystems ...
[ ok ]
* Executing: /lib/rc/sh/openrc-run.sh /lib/rc/sh/openrc-run.sh /etc/init.d/sshd start
* Starting sshd ...
* start-stop-daemon: fopen `/run/sshd.pid': No such file or directory
* Detaching to start `/usr/sbin/sshd' ...
[ ok ]
[ ok ]
* Executing: /lib/rc/sh/openrc-run.sh /lib/rc/sh/openrc-run.sh /etc/init.d/local start
* Starting local ...
[ ok ]
rc default logging stopped at Sat Oct 5 02:35:40 2024 |
nvme id-ctrl -H /dev/nvme0
Code: | NVME Identify Controller:
vid : 0x144d
ssvid : 0x144d
sn : S7DNNU0X576222A
mn : Samsung SSD 990 PRO 2TB
fr : 4B2QJXD7
rab : 2
ieee : 002538
cmic : 0
[3:3] : 0 ANA not supported
[2:2] : 0 PCI
[1:1] : 0 Single Controller
[0:0] : 0 Single Port
mdts : 9
cntlid : 0x1
ver : 0x20000
rtd3r : 0x30d40
rtd3e : 0x989680
oaes : 0x200
[31:31] : 0 Discovery Log Change Notice Not Supported
[27:27] : 0 Zone Descriptor Changed Notices Not Supported
[15:15] : 0 Normal NSS Shutdown Event Not Supported
[14:14] : 0 Endurance Group Event Aggregate Log Page Change Notice Not Supported
[13:13] : 0 LBA Status Information Notices Not Supported
[12:12] : 0 Predictable Latency Event Aggregate Log Change Notices Not Supported
[11:11] : 0 Asymmetric Namespace Access Change Notices Not Supported
[9:9] : 0x1 Firmware Activation Notices Supported
[8:8] : 0 Namespace Attribute Changed Event Not Supported
ctratt : 0x10
[19:19] : 0 Flexible Data Placement Not Supported
[15:15] : 0 Extended LBA Formats Not Supported
[14:14] : 0 Delete NVM Set Not Supported
[13:13] : 0 Delete Endurance Group Not Supported
[12:12] : 0 Variable Capacity Management Not Supported
[11:11] : 0 Fixed Capacity Management Not Supported
[10:10] : 0 Multi Domain Subsystem Not Supported
[9:9] : 0 UUID List Not Supported
[8:8] : 0 SQ Associations Not Supported
[7:7] : 0 Namespace Granularity Not Supported
[6:6] : 0 Traffic Based Keep Alive Not Supported
[5:5] : 0 Predictable Latency Mode Not Supported
[4:4] : 0x1 Endurance Groups Supported
[3:3] : 0 Read Recovery Levels Not Supported
[2:2] : 0 NVM Sets Not Supported
[1:1] : 0 Non-Operational Power State Permissive Not Supported
[0:0] : 0 128-bit Host Identifier Not Supported
rrls : 0
cntrltype : 1
[7:2] : 0 Reserved
[1:0] : 0x1 I/O Controller
fguid : 00000000-0000-0000-0000-000000000000
crdt1 : 0
crdt2 : 0
crdt3 : 0
nvmsr : 0
[1:1] : 0 NVM subsystem Not part of an Enclosure
[0:0] : 0 NVM subsystem Not part of a Storage Device
vwci : 0
[7:7] : 0 VPD Write Cycles Remaining field is Not valid.
[6:0] : 0 VPD Write Cycles Remaining
mec : 0
[1:1] : 0 NVM subsystem Not contains a Management Endpoint on a PCIe port
[0:0] : 0 NVM subsystem Not contains a Management Endpoint on an SMBus/I2C port
oacs : 0x17
[10:10] : 0 Lockdown Command and Feature Not Supported
[9:9] : 0 Get LBA Status Capability Not Supported
[8:8] : 0 Doorbell Buffer Config Not Supported
[7:7] : 0 Virtualization Management Not Supported
[6:6] : 0 NVMe-MI Send and Receive Not Supported
[5:5] : 0 Directives Not Supported
[4:4] : 0x1 Device Self-test Supported
[3:3] : 0 NS Management and Attachment Not Supported
[2:2] : 0x1 FW Commit and Download Supported
[1:1] : 0x1 Format NVM Supported
[0:0] : 0x1 Security Send and Receive Supported
acl : 7
aerl : 3
frmw : 0x16
[5:5] : 0 Multiple FW or Boot Update Detection Not Supported
[4:4] : 0x1 Firmware Activate Without Reset Supported
[3:1] : 0x3 Number of Firmware Slots
[0:0] : 0 Firmware Slot 1 Read/Write
lpa : 0x2f
[6:6] : 0 Telemetry Log Data Area 4 Not Supported
[5:5] : 0x1 LID 0x0, Scope of each command in LID 0x5, 0x12, 0x13 Supported
[4:4] : 0 Persistent Event log Not Supported
[3:3] : 0x1 Telemetry host/controller initiated log page Supported
[2:2] : 0x1 Extended data for Get Log Page Supported
[1:1] : 0x1 Command Effects Log Page Supported
[0:0] : 0x1 SMART/Health Log Page per NS Supported
elpe : 63
[7:0] : 63 (0's based) Error Log Page Entries (ELPE)
npss : 4
[7:0] : 4 (0's based) Number of Power States Support (NPSS)
avscc : 0x1
[0:0] : 0x1 Admin Vendor Specific Commands uses NVMe Format
apsta : 0x1
[0:0] : 0x1 Autonomous Power State Transitions Supported
wctemp : 355
[15:0] : 82 °C (355 K) Warning Composite Temperature Threshold (WCTEMP)
cctemp : 358
[15:0] : 85 °C (358 K) Critical Composite Temperature Threshold (CCTEMP)
mtfa : 0
hmpre : 0
hmmin : 0
tnvmcap : 2.000.398.934.016
[127:0] : 2.000.398.934.016
Total NVM Capacity (TNVMCAP)
unvmcap : 0
[127:0] : 0
Unallocated NVM Capacity (UNVMCAP)
rpmbs : 0
[31:24]: 0 Access Size
[23:16]: 0 Total Size
[5:3] : 0 Authentication Method
[2:0] : 0 Number of RPMB Units
edstt : 35
dsto : 0
fwug : 0
kas : 0
hctma : 0x1
[0:0] : 0x1 Host Controlled Thermal Management Supported
mntmt : 318
[15:0] : 45 °C (318 K) Minimum Thermal Management Temperature (MNTMT)
mxtmt : 356
[15:0] : 83 °C (356 K) Maximum Thermal Management Temperature (MXTMT)
sanicap : 0x60000003
[31:30] : 0x1 Media is not additionally modified after sanitize operation completes successfully
[29:29] : 0x1 No-Deallocate After Sanitize bit in Sanitize command Not Supported
[2:2] : 0 Overwrite Sanitize Operation Not Supported
[1:1] : 0x1 Block Erase Sanitize Operation Supported
[0:0] : 0x1 Crypto Erase Sanitize Operation Supported
hmminds : 0
hmmaxd : 0
nsetidmax : 0
endgidmax : 1
anatt : 0
anacap : 0
[7:7] : 0 Non-zero group ID Not Supported
[6:6] : 0 Group ID does change
[4:4] : 0 ANA Change state Not Supported
[3:3] : 0 ANA Persistent Loss state Not Supported
[2:2] : 0 ANA Inaccessible state Not Supported
[1:1] : 0 ANA Non-optimized state Not Supported
[0:0] : 0 ANA Optimized state Not Supported
anagrpmax : 0
nanagrpid : 0
pels : 0
domainid : 0
megcap : 0
sqes : 0x66
[7:4] : 0x6 Max SQ Entry Size (64)
[3:0] : 0x6 Min SQ Entry Size (64)
cqes : 0x44
[7:4] : 0x4 Max CQ Entry Size (16)
[3:0] : 0x4 Min CQ Entry Size (16)
maxcmd : 256
nn : 1
oncs : 0x55
[10:10] : 0 All Fast Copy Not Supported
[9:9] : 0 Copy Single Atomicity Not Supported
[8:8] : 0 Copy Not Supported
[7:7] : 0 Verify Not Supported
[6:6] : 0x1 Timestamp Supported
[5:5] : 0 Reservations Not Supported
[4:4] : 0x1 Save and Select Supported
[3:3] : 0 Write Zeroes Not Supported
[2:2] : 0x1 Data Set Management Supported
[1:1] : 0 Write Uncorrectable Not Supported
[0:0] : 0x1 Compare Supported
fuses : 0
[0:0] : 0 Fused Compare and Write Not Supported
fna : 0x7
[3:3] : 0 Format NVM Broadcast NSID (FFFFFFFFh) Supported
[2:2] : 0x1 Crypto Erase Supported as part of Secure Erase
[1:1] : 0x1 Crypto Erase Applies to All Namespace(s)
[0:0] : 0x1 Format Applies to All Namespace(s)
vwc : 0x7
[2:1] : 0x3 The Flush command supports NSID set to FFFFFFFFh
[0:0] : 0x1 Volatile Write Cache Present
awun : 1023
awupf : 0
icsvscc : 1
[0:0] : 0x1 NVM Vendor Specific Commands uses NVMe Format
nwpc : 0
[2:2] : 0 Permanent Write Protect Not Supported
[1:1] : 0 Write Protect Until Power Supply Not Supported
[0:0] : 0 No Write Protect and Write Protect Namespace Not Supported
acwu : 0
ocfs : 0
[3:3] : 0 Controller Copy Format 3h Not Supported
[2:2] : 0 Controller Copy Format 2h Not Supported
[1:1] : 0 Controller Copy Format 1h Not Supported
[0:0] : 0 Controller Copy Format 0h Not Supported
sgls : 0
[15:8] : 0 SGL Descriptor Threshold
[1:0] : 0 Scatter-Gather Lists Not Supported
mnan : 0
maxdna : 0
maxcna : 0
oaqd : 0
subnqn : nqn.1994-11.com.samsung:nvme:990PRO:M.2:S7DNNU0X576222A
ioccsz : 0
iorcsz : 0
icdoff : 0
fcatt : 0
[0:0] : 0 Dynamic Controller Model
msdbd : 0
ofcs : 0
[0:0] : 0 Disconnect command Not Supported
ps 0 : mp:9.39W operational enlat:0 exlat:0 rrt:0 rrl:0
rwt:0 rwl:0 idle_power:- active_power:-
active_power_workload:-
ps 1 : mp:9.39W operational enlat:0 exlat:0 rrt:1 rrl:1
rwt:1 rwl:1 idle_power:- active_power:-
active_power_workload:-
ps 2 : mp:9.39W operational enlat:0 exlat:0 rrt:2 rrl:2
rwt:2 rwl:2 idle_power:- active_power:-
active_power_workload:-
ps 3 : mp:0.0400W non-operational enlat:4200 exlat:2700 rrt:3 rrl:3
rwt:3 rwl:3 idle_power:- active_power:-
active_power_workload:-
ps 4 : mp:0.0050W non-operational enlat:500 exlat:21800 rrt:4 rrl:4
rwt:4 rwl:4 idle_power:- active_power:-
active_power_workload:-
|
What can be done here?
Last edited by piewie on Fri Oct 11, 2024 7:52 pm; edited 3 times in total |
|
Back to top |
|
|
piewie Guru
Joined: 26 Sep 2002 Posts: 494 Location: old Europe
|
Posted: Sun Oct 06, 2024 6:40 pm Post subject: |
|
|
There seems to be a lock for several seconds after booting. After a few seconds pressing <STRG>+<C> and several times pressing <ENTER> suddenly the login prompt appears from the top of the monitor without clearing the old openrc messages. I can login now, but a bit later there is a lock again - without getting into the command line again, so far. |
|
Back to top |
|
|
piewie Guru
Joined: 26 Sep 2002 Posts: 494 Location: old Europe
|
Posted: Sun Oct 06, 2024 8:38 pm Post subject: |
|
|
the network is fixed. The device name enpXs0 was changing with reboots, so a udev rule fixed the network device to enp3s0. SSH access is working without a lock, when started from nvmme. So it is probably a graphics card / monitor thing now. I have disabled the framebuffer in the kernel, but this did not help. I can't switch between the console session with <ALT>+<CTRL><Fx> - |
|
Back to top |
|
|
piewie Guru
Joined: 26 Sep 2002 Posts: 494 Location: old Europe
|
Posted: Tue Oct 08, 2024 5:31 pm Post subject: |
|
|
Added EDID in the kernel for the monitor detection. The VESA framebuffer is doing its job here now. I can do the <CTRL>+<ALT>+<Fx> stuff, but both monitors do not come back from suspension. |
|
Back to top |
|
|
piewie Guru
Joined: 26 Sep 2002 Posts: 494 Location: old Europe
|
Posted: Fri Oct 11, 2024 7:52 pm Post subject: |
|
|
Since the last reboot both monitors are behaving well. I think everything is solved now. |
|
Back to top |
|
|
|